First things first
An overview of projectsHow to use the project readme Take notes and upload dataTranscribe video and audio
Get started with transcriptionChange and rename speakersMerge and split monologuesKeyboard shortcutsSupported file formatsSupported languagesTips for optimal resultsCustom vocabularyHow our model is trainedTranscription pricing
Format your notes and transcriptsFormatting and keyboard shortcutsAdd structured data with fields
More articles
Help homeThe basicsFirst things first

Transcribe video and audio

Dovetail’s built-in transcription and video highlights are a powerful way to share stories about your research and develop a repository of searchable audio and video clips.

Upload a video or audio recording you’ve taken from an interview, usability test, sales call, or product demo. Dovetail will process the recording into a fast, streamable format, and transcribe it using an advanced AI-powered speech engine. Then, create highlights (Highlight and tag project content) to turn your raw recording into tagged, searchable audio and video clips.

Get started with transcription

To upload a video or audio recording:

  1. Click Data in the sidebar.

  2. Click + Add data.

  3. Click the Video or Audio icon.

  4. Choose a file from your computer.

  5. Next to Transcribe this file, click Begin.

Your recording will be uploaded into a note and processed to ensure fast playback. The amount of time this takes depends on the length of the recording. In general, processing takes about 30% of the length of the file; e.g. a 60 minute recording will take approximately 20 minutes to process and transcribe.

You can close the note and continue using other parts of Dovetail while your file is uploading. You can safely leave Dovetail entirely (e.g. close the browser window or turn off your computer) while it’s processing and being transcribed, and come back later.

Change and rename speakers

While our AI speech-engine will attempt to automatically detect different speakers, it sometimes doesn’t get it right. You can change and rename the speaker for a monologue (a period of speech by one person) by clicking their name, which defaults to Speaker 1 and Speaker 2. Changing the speaker for a monologue only changes it there, but renaming a speaker renames it everywhere it's used.

Merge and split monologues

You can merge two monologues into one by placing your cursor at the start of the second monologue and pressing backspace. Similarly, you can split one monologue into two by placing your cursor inside and pressing enter.

Keyboard shortcuts

Dovetail provides a number of keyboard shortcuts to control the playback for audio and video. To see them, click the Actions (···) menu in the top right of a note, tag, or insight, then Shortcuts.

Supported file formats

Dovetail supports the following file formats:

Video formats:

  • mp4

  • mov

  • mpeg

  • avi

Audio formats:

  • mp3

  • m4a

  • wav

Supported languages

At the moment, transcription only supports English.

Tips for optimal results

Here are a few tips to improve the quality of your recordings and transcript:

  • Record in a quiet setting with minimal background noise.

  • Invest in quality recording equipment, such as a microphone or recorder.

  • Speak clearly, loudly, and slowly.

  • Avoid talking over other people.

Custom vocabulary

To improve the accuracy of transcripts you can submit a list of custom words or phrases that are not found in a dictionary (for example company names or industry jargon) before starting a transcription.

You can paste a comma separated list of terms that you want to include in the transcription dialog to avoid having to type out the terms every time.

How our model is trained

The AI speech engine we use is trained on 50,000+ hours of human-transcribed content across a diversity of topics, industries, and accents. This makes our transcripts some of the most accurate available.

Transcription pricing

Transcription and video highlights is free to use while this feature is in beta. After beta, all paid plans will include transcription minutes, and additional minutes can be purchased for $0.20 USD per minute.

Was this article useful?
Related articles
Import and export
Upload images, audio, and video
Kai Forsyth
Product Marketer
David Richard
Lead Developer
Article info
Last updated 5 November 2020
3 min read

Get help

Can’t find what you’re looking for? Search through our articles or contact our support team and get a response within 24 hours.

Get help