Two questions about transcription challenge

Jaesung · May 16, 2023, 8:31pm

I want to ask two questions regarding speech transcription challenge.

(1) Is it okay to use OpenAI’s whisper model (Introducing Whisper) for our challenge? I know that it’s a publicly available model, but want to make sure.

(2) If so, do you normalise our submission json file before evaluation? Or do I have to submit the json file after normalising with english.glm provided in the github repo?

Thank you!

jachym · May 18, 2023, 11:08am

I am not sure how you use Whisper as part of your solution, but please be sure you comply with Challenge Official Rules https://ego4d-interactive-fig1.s3.eu-west-2.amazonaws.com/tc.pdf especially the part “Intellectual Property”.
Before scoring the submissions, our evaluation script will itself lowercase the words in the transcript and apply the English GLM.

Topic		Replies	Views
Speech transcription challenge submission issues Ego4D Challenges audio-visual	4	348	May 9, 2023
Transcription Challenge leaderboard is not ordered by WER Ego4D Challenges audio-visual	2	278	May 19, 2023
TTM submission failed Ego4D Challenges audio-visual	4	328	March 8, 2023
Could you extend the deadline? Ego4D Challenges	0	277	May 19, 2023
Which challenge do you intend to participate in? Ego4D Challenges	3	513	May 4, 2022

Two questions about transcription challenge

Related topics