1. Начало
  2. Транскрибиране на аудио и видео
  3. Transcription Glossary: From Basic Terms to Advanced Software

Transcription Glossary: From Basic Terms to Advanced Software

Cliff Weitzman

Клиф Вайцман

Главен изпълнителен директор и основател на Speechify

#1 AI генератор на глас
Създавайте записи с човешко звучене
в реално време.

apple logoApple Design Award 2025
50M+ потребители

Here is a Glossary of the Top 96 Transcription Terms - in Alphabetical Order

1. Acoustic Model: Represents the relationship between an audio signal and the phonetic units in a language.

2. Alignment: Ensuring that the transcribed text matches the audio.

3. Audio File: A file format containing recorded speech or sounds.

4. Automatic Speech Recognition (ASR): Technology that converts spoken language into written text.

5. Background Noise: Undesired or irrelevant sounds in the background of an audio file.

6. Billing Hour: Charging based on the actual time spent transcribing.

7. Billing Minute: Charging based on the total length of the audio or video.

8. Bit Rate: The number of bits processed per unit of time in an audio file.

9. CAPTCHA: System designed to distinguish human from machine input.

10. Clean Read: Editing out stammers, repetitions, and non-standard language.

11. Closed Captioning: The display of text on a video screen to provide additional information.

12. Codec: Program or device that compresses and decompresses digital audio data.

13. Confidence Score: In speech recognition, a measure of the system's certainty.

14. Confidentiality Agreement: Contract ensuring transcriptionist non-disclosure.

15. Crosstalk: When two or more people talk simultaneously.

16. Decibel (dB): Unit measuring sound intensity.

17. Diarization Error Rate (DER): Metric measuring speaker diarization accuracy.

18. Dictation: Speaking aloud for transcription.

19. Dictation Device: Device used to record speech for transcription.

20. DSS (Digital Speech Standard): A proprietary compressed digital audio file format.

21. Editing: Correcting and refining a transcript.

22. Equalization (EQ): Adjusting audio signal frequency components.

23. Feedback: Noise or interference in an audio recording.

24. Feedback Loop: Learning from corrections in transcription.

25. Filler Words: Words like "um", "uh" used as speech pauses.

26. Foot Pedal: Tool for hands-free playback control during transcription.

27. Focus Group Transcription: Transcribing focus group discussions.

28. Full Disclosure: Revealing everything heard without omissions.

29. Grammar Check: Ensuring proper grammar in transcribed text.

30. Hotkey: Keyboard shortcut in transcription software.

31. Human Review: Manually checking and correcting a transcript.

32. Inaudible: Unclear section of audio.

33. Insert: Adding extra text for clarity.

34. Interview Transcription: Transcribing interview content.

35. Language Model: Predicts word sequence probability in speech recognition.

36. Legal Transcription: Transcribing legal content.

37. Loop Listening: Repeatedly listening to an audio segment for accuracy.

38. Markup: Notations added to indicate specific transcript formatting.

39. Machine Transcription (MT): Using software to transcribe speech.

40. Medical Transcription: Transcribing medical dictations or recordings.

41. Mono: Single-channel sound recording and reproduction.

42. MP3 File: Digital audio format with lossy compression.

43. Noise Profile: Characteristics of consistent background noise in audio.

44. Noise Reduction: Techniques to minimize unwanted sounds in recordings.

45. NDA (Non-Disclosure Agreement): Contract ensuring confidentiality.

46. Outsource: Hiring external services like transcription outside the primary entity.

47. Overlapping Dialogue: When speakers talk over one another.

48. Paralinguistics: Non-lexical speech features, e.g., tone, pitch.

49. Phrase List: List of common phrases appearing in audio.

50. Pitch: Perceived frequency of a sound.

51. Playback Speed: Speed at which audio plays back.

52. Podcast Transcription: Transcribing podcast content.

53. Pop Filter: Screen used to minimize popping sounds in recording.

54. Proofreading: Final transcript review for mistakes.

55. Punctuation: Marks clarifying meaning in written text.

56. QC (Quality Control): Reviewing transcription accuracy.

57. Raw Transcript: Initial draft before editing or formatting.

58. Recording Environment: Setting where audio recording occurs.

59. Research Transcription: Transcribing academic or professional research recordings.

60. Rev: Popular online transcription service.

61. Rate: Fee charged for transcription, often per audio minute.

62. Rush Job: Transcription task with a short completion timeframe.

63. Sample Rate: Number of audio samples carried per second.

64. Scoping: Editing and formatting raw stenographic notes.

65. Segment: A section of an audio file.

66. Shorthand: Rapid handwriting system using symbols for words or phrases.

67. Speaker Diarization: Distinguishing and labeling different speakers in audio.

68. Speaker Identification: Labeling different conversation speakers.

69. Speech Recognition Software: Converts spoken language into text.

70. Steno Machine: Specialized keyboard for shorthand typing.

71. Stenography: Writing in shorthand.

72. Stereo: Two-channel sound recording and reproduction.

73. Subtitle: Translations or dialogue interpretations in movies or broadcasts.

74. SRT (SubRip Subtitle file): File format storing subtitles or transcript timings.

75. Sync: Ensuring transcribed text matches video spoken words.

76. Teleconference Transcription: Transcribing virtual meeting content.

77. Template: Preset format for transcriptions.

78. Time Code: Markers indicating hours, minutes, seconds, and frames.

79. Time-Coding: Inserting transcript time markers for specific segments.

80. Timestamp: Marker indicating when a word or phrase is spoken in audio.

81. Timestamping: Inserting time codes in transcripts at regular intervals or new speaker instances.

82. TranscribeMe: Popular online transcription service.

83. Transcription: Converting spoken language into written text.

84. Transcription Ethics: Moral principles guiding the transcription profession.

85. Transcription Kit: Tools and software package for professional transcriptionists.

86. Transcription Platform: Online tool or platform for transcription work.

87. Transcription Software: Programs facilitating the transcription process.

88. Transcriptionist: Person converting audio or video content into text.

89. Transcript: Resulting written document from transcription.

90. Turnaround Time: Time taken or promised to complete transcription.

91. Verbatim: Transcribing exactly as spoken.

92. Voice Recognition: Identifying a person based on vocal attributes.

93. Voice Writing: Repeating spoken words into a speech recognition system for transcription.

94. WAV File: Common audio file format.

95. Word Error Rate (WER): Metric measuring transcribed words' accuracy against original audio.

96. Work Queue: List of transcription tasks for a transcriptionist.

Frequently Asked Questions

What is an example of a transcription?

Transcription is the process of converting audio recordings, such as a podcast or a Zoom meeting, into a text file. For instance, if a professor gives a lecture on the topic of mRNA (messenger RNA) and its role in gene expression, a transcriptionist might convert that lecture into a written document.

How should you transcribe words?

When transcribing, words should be written verbatim. If a speaker uses filler words like "um" and "uh", they should be included. The transcriptionist should be fluent in the language of the recording, whether it’s English, French, or Latin. Real-time transcription requires the transcriptionist to type rapidly while listening.

How do you use glossary terms on Rev?

Rev, a popular transcription platform, provides tutorials on how to effectively use their transcription glossary in their workspace. It ensures uniformity in terms, especially when transcribing niche topics like healthcare or genome studies.

What is the vocabulary of transcription?

The vocabulary of transcription includes terms like "subtitles", "template", "playback", and "automated transcription". It's crucial for a transcriptionist transitioning into the field to familiarize themselves with this glossary.

How do you use a glossary?

A glossary is used as a reference guide. When encountering a term or symbol one is unfamiliar with, they can refer to the glossary. It’s especially useful in fields like gene expression where terms like "promoter" might be used.

What are the transcription symbols?

Transcription symbols are a set of standardized symbols used to represent specific sounds or events in audio files. They might indicate things like interrupted speech, overlapping speech, or background noises.

What is the difference between a glossary and a transcription?

A glossary is a list of terms with definitions, used for reference. A transcription is the written representation of spoken words from audio recordings.

Top 8 Transcription Software/Apps:

  1. Rev: Known for its fast service and competitive pricing, also offers an API for developers.
  2. Otter.ai: Real-time speech recognition for instant transcription. It offers automated transcription and integrates with Zoom.
  3. Scribie: Focuses on accurate transcriptions using a combination of automated workflows and human review.
  4. Temi: Uses advanced speech recognition for automated transcription, with a quick turnaround.
  5. Sonix: Offers transcription in multiple languages and integrates with various social media platforms.
  6. Trint: Provides real-time transcriptions and offers a workspace for collaborative editing.
  7. Descript: Merges audio editing with transcription, ideal for podcasters.
  8. Speechmatics: Known for its vast datasets and ability to handle different languages and dialects.

Създавайте дублажи, клонинги и гласове с над 1 000 гласа на 100+ езика

Пробвайте безплатно
studio banner faces

Споделете тази статия

Cliff Weitzman

Клиф Вайцман

Главен изпълнителен директор и основател на Speechify

Клиф Вайцман е застъпник за хора с дислексия и е главен изпълнителен директор и основател на Speechify — приложението номер 1 в света за преобразуване на текст в реч, с над 100 000 петзвездни отзива и първо място в App Store в категорията „Новини и списания“. През 2017 г. Вайцман е включен в престижния списък Forbes 30 под 30 за приноса си към това интернет да бъде по-достъпен за хора с обучителни затруднения. Клиф Вайцман е представян в EdSurge, Inc., PC Mag, Entrepreneur, Mashable и много други водещи медии.

speechify logo

За Speechify

#1 четец за текст към реч

Speechify е водещата в света платформа за текст към реч, на която се доверяват над 50 милиона потребители и която има повече от 500 000 петзвездни отзива за своите приложения за текст към реч за iOS, Android, разширение за Chrome, уеб приложение и настолно приложение за Mac. През 2025 година Apple отличи Speechify с престижната Apple Design Award на WWDC, определяйки я като „ключов ресурс, който помага на хората да живеят по-добре“. Speechify предлага над 1000 естествено звучащи гласа на над 60 езика и се използва в близо 200 държави. Сред известните гласове са Snoop Dogg и Гуинет Полтроу. За създатели и бизнеси Speechify Studio предоставя напреднали инструменти, включително AI генератор на гласове, AI клониране на глас, AI дублаж и AI променящ глас. Speechify също задвижва водещи продукти със своето висококачествено и достъпно като цена API за текст към реч. Представено в The Wall Street Journal, CNBC, Forbes, TechCrunch и други водещи медии, Speechify е най-големият доставчик на услуги за текст към реч в света. Посетете speechify.com/news, speechify.com/blog и speechify.com/press, за да научите повече.