Options
All
  • Public
  • Public/Protected
  • All
Menu

This is an object representing a transcription. You can create them, retrieve them to see their status and results, and delete them.

Hierarchy

Index

Constructors

Properties

audio_duration?: number

The duration of your media file, in seconds.

audio_end_at?: number

The point in time, in milliseconds, to stop transcribing in your media file.

audio_start_from?: number

The point in time, in milliseconds, to begin transcription from in your media file.

audio_url: string

The URL of your media file to transcribe.

auto_chapters?: boolean

Enable Auto Chapters, can be true or false.

auto_highlights?: boolean

Enable Automatic Transcript Highlights, can be true or false.

auto_highlights_result?: AutoHighlightsResult

The list of results when enabling Automatic Transcript Highlights.

boost_param?: string

The weight to apply to words/phrases in the word_boost array; can be "low", "default", or "high".

chapters?: Chapter[]

When Auto Chapters is enabled, the list of Auto Chapters results.

confidence?: number

The confidence our model has in the transcribed text, between 0.0 and 1.0.

content_safety?: boolean

Enable Content Safety Detection, can be true or false.

content_safety_labels?: ContentSafetyLabels

The list of results when TranscriptRequest.content_safety is true.

custom_spelling?: CustomSpelling[]

Customize how words are spelled and formatted using to and from values.

disfluencies?: boolean

Transcribe Filler Words, like "umm", in your media file; can be true or false.

dual_channel?: boolean

Enable Dual Channel transcription, can be true or false.

entities?: Entity[]

When Entity Detection is enabled, the list of detected Entities.

entity_detection?: boolean

Enable Entity Detection, can be true or false.

error?: string

The error message if the transcript status is error.

filter_profanity?: boolean

Filter profanity from the transcribed text, can be true or false.

format_text?: boolean

Enable Text Formatting, can be true or false.

iab_categories?: boolean

Enable Topic Detection, can be true or false.

iab_categories_result?: IabCategoriesResult

Enable Topic Detection, can be true or false.

id?: string

The unique identifier of your transcription.

language_code?: string

The language of your audio file. Possible values are found in Supported Languages. The default value is en_us.

language_detection?: boolean

Enable Automatic Language Detection, can be true or false.


The Automatic Language Detection feature can identify the dominant language that’s spoken in an audio file, and route the file to the appropriate model for the detected language.

Note - Automatic Language Detection supports detecting English, French, German, Italian, and Spanish currently. We will be adding support for more languages over time.

To enable this feature, include the language_detection parameter with a value of true in your POST request when submitting a file for processing.

If you know the language of the spoken audio in a file, you can specify that in your POST request as shown in the documentation for Specifying a Language.

Heads Up - In order to reliably identify the dominant language in a file, the model needs approximately 50 seconds of spoken audio in that language over the course of the audio file.

punctuate?: boolean

Enable Automatic Punctuation, can be true or false.

redact_pii?: boolean

Redact PII from the transcribed text, can be true or false.

With PII Redaction, the API can automatically remove Personally Identifiable Information (PII), such as phone numbers and social security numbers, from the transcription text before it is returned to you.

All redacted text will be replaced with # characters. For example, if the phone number 111-2222 was spoken in the audio, it would be transcribed as ###-#### in the text.

redact_pii_audio?: boolean

Generate a copy of the original media file with spoken PII "beeped" out, can be true or false.

redact_pii_policies?: string[]

The list of PII Redaction policies to enable.

To best-fit PII Redaction to your use case and data, you can select from a set of redaction policies when using PII Redaction. Simply include any or some of the below policy names in the redact_pii_policies array when making your POST request as shown on the right.

Policy Name Description
medical_process Medical process, including treatments, procedures, and tests (e.g., heart surgery, CT scan)
medical_condition Name of a medical condition, disease, syndrome, deficit, or disorder (e.g., chronic fatigue syndrome, arrhythmia, depression)
blood_type Blood type (e.g., O-, AB positive)
drug Medications, vitamins, or supplements (e.g., Advil, Acetaminophen, Panadol)
injury Bodily injury (e.g., I broke my arm, I have a sprained wrist)
number_sequence A "lazy" rule that will redact any sequence of numbers equal to or greater than 2
email_address Email address (e.g., support@assemblyai.com))
date_of_birth Date of Birth (e.g., Date of Birth: March 7,1961)
phone_number Telephone or fax number
us_social_security_number Social Security Number or equivalent
credit_card_number Credit card number
credit_card_expiration Expiration date of a credit card
credit_card_cvv Credit card verification code (e.g., CVV: 080)
date Specific calendar date (e.g., December 18)
nationality Terms indicating nationality, ethnicity, or race (e.g., American, Asian, Caucasian)
event Name of an event or holiday (e.g., Olympics, Yom Kippur)
language Name of a natural language (e.g., Spanish, French)
location Any Location reference including mailing address, postal code, city, state, province, or country
money_amount Name and/or amount of currency (e.g., 15 pesos, $94.50)
person_name Name of a person (e.g., Bob, Doug Jones)
person_age Number associated with an age (e.g., 27, 75)
organization Name of an organization (e.g., CNN, McDonalds, University of Alaska)
political_affiliation Terms referring to a political party, movement, or ideology (e.g., Republican, Liberal)
occupation Job title or profession (e.g., professor, actors, engineer, CPA)
religion Terms indicating religious affiliation (e.g., Hindu, Catholic)
drivers_license Driver’s license number (e.g., DL# 356933-540)
banking_information Banking information, including account and routing numbers
redact_pii_sub?: string

The replacement logic for detected PII, can be "entity_type" or "hash".

sentiment_analysis?: boolean

Enable Sentiment Analysis, can be true or false.

sentiment_analysis_results?: SentimentAnalysisResult[]

When Sentiment Analysis is enabled, the list of Sentiment Analysis results.

speaker_labels?: boolean

Enable Speaker Diarization, can be true or false.

status?: string

The status of your transcription. queued, processing, completed, or error

text?: string

The text transcription of your media file.

utterances?: Utterance[]

When dual_channel or speaker_labels is enabled, a list of turn-by-turn utterances.

webhook_status_code?: string

The status code we received from your server when delivering your webhook.

webhook_url?: string

The URL we should send webhooks to when your transcript is complete.

word_boost?: string[]

A list of custom vocabulary to boost accuracy for.

words?: Word[]

A list of all the individual words transcribed.

Generated using TypeDoc