Aditu - Elhuyar speech recognition - Frequently Asked Questions

What is the success rate of Aditu transcriptions?

Hainbat faktoreren araberakoa da asmatze-tasa: audio-grabazioaren kalitatea, oihartzuna, zarata edo musika dagoen atzetik, hizketa-mota, erregistroa, hizkuntza estandarrean edo aldaera batean den, bolumena, abiadura... Baldintza optimoetan, % 95etik gorakoa izan daiteke asmatze-tasa.

How can I get the best results for my transcription?

To get the best transcription results, make an effort to have the following characteristics in your recording:

A good microphone (directional, eliminating ambient sound, using earphones...)
No echo or noise
Absence of background music (or with this on a different track)
Correct and complete sentences, with no repetitions, corrections or hesitations
Formal register
Standard language
A single speaker
Adequate volume, neither too loud nor too soft
Proper speed, neither too fast nor too slow

If I have the voice on one track and the music on another, how will Aditu know that it only has to use one?

On uploading the file, you can indicate the voice track, if this is separate

What files or inputs does Aditu admit?

It accepts the most usual audio and video formats, such as WAV, MP3, MP4, MOV, WEBM, etc. You can also record from the website using the microphone on your computer or phone, without having to prepare any file.

What languages does Aditu recognise?

For the time being, Aditu recognises Basque, Spanish, English, French, Catalan and Galician as well as bilingual speech (for example, municipal plenary sessions).

How much time will I need to get the transcription?

In the case of uploaded files, it will depend on the server load, but it will normally take much less time than the duration of the file.

In what format does Aditu return the transcription? Unformatted text, subtitles, etc?

Aditu will provide you with:

The transcription in unformatted text.
A subtitle file (i.e., the transcription and the time stamps for each sentence or phrase, so that conventional audio or video players or web browsers can understand and display them) in different formats: SRT, VTT, etc.
Transcription with time stamps for each word (for example, for advanced searches in video).

Numbers, times, dates, abbreviations, acronyms, capitals, lowercase and punctuation signs ... will be correctly transcribed in all of them.

Frequently Asked Questions

What is the success rate of Aditu transcriptions?

How can I get the best results for my transcription?

If I have the voice on one track and the music on another, how will Aditu know that it only has to use one?

What files or inputs does Aditu admit?

What languages does Aditu recognise?

How much time will I need to get the transcription?

In what format does Aditu return the transcription? Unformatted text, subtitles, etc?

Can it translate the transcriptions and subtitles?

What time credit will I have for the tests?

How can I buy more time?

Can I use the Aditu transcriber from my working tool instead of using the website?