Transcripts

The Quartr API provides access to transcripts of company events. These transcripts offer a detailed textual representation of event audio. Below is an overview of the transcript data format.

Transcript Data Format

Transcripts are accessible within the events-object in the API. The transcript is available under the transcriptUrl in the transcript object on the event. See documentation for more details. The structure of the transcript object is as follows:

  • version: A string indicating the version of the transcript format. Currently set to "1".

  • event_id: The Quartr event ID related to the event.

  • company_id: The Quartr company ID of the company related to the event.

  • transcript: An object containing the detailed transcript data, structured as:

    • text: A full transcript text of the event.

    • number_of_speakers: An integer indicating the total number of speakers in the event.

    • paragraphs: An array of paragraph objects, each containing:

      • text: The text of the paragraph.

      • start: The start time of the paragraph in seconds.

      • end: The end time of the paragraph in seconds.

      • sentences: An array of sentence objects within the paragraph, each containing:

        • text: The text of the sentence.

        • start: The start time of the sentence in seconds.

        • end: The end time of the sentence in seconds.

        • speaker: An identifier for the speaker (if available).

        • words: An array of word objects in the sentence, each with:

          • word: The word in lowercase.

          • punctuated_word: The word with original punctuation.

          • start: The start time of the word in seconds.

          • end: The end time of the word in seconds.

          • confidence: A float representing the confidence level of the transcription for this word.

Example JSON Structure:

{
    "version": "1",
    "event_id": 84084,
    "company_id": 5710,
    "transcript": {
        "text": "Full transcript text here...",
        "number_of_speakers": 12,
        "paragraphs": [
            {
                "text": "Paragraph text here...",
                "start": 0.0,
                "end": 42.565,
                "sentences": [
                    {
                        "text": "Sentence text here...",
                        "start": 0.0,
                        "end": 2.6599998,
                        "speaker": 0,
                        "words": [
                            {
                                "word": "word1",
                                "punctuated_word": "Word1",
                                "start": 0.0,
                                "end": 0.48,
                                "confidence": 1.0
                            },
                            // More words...
                        ]
                    },
                    // More sentences...
                ]
            },
            // More paragraphs...
        ]
    }
}