Show HN: Duolingo-style exercises but with real-world content like the news

Posted by ph4evers 4 days ago

Show HN: Duolingo-style exercises but with real-world content like the news(app.fluentsubs.com)

I've been working on a little side project that combines Duolingo-like listening comprehension exercises with real content .

Every video is transcribed to get much better transcripts than the closed captions. I filter on high quality transcripts, and afterwards a LLM selects only plausible segments for the exercises. This seems to work well for quality control and seems to be reliable enough for these short exercises.

Would love your thoughts!

465 points | 183 commentspage 4

mdaniel 4 days ago|

It seems we either ate all your LLM credits or knocked your server over since the spinner just spins (checking dev tools coughs up that https://app.fluentsubs.com/api/exercises/daily?language=fr is 504)

After 4 retries, the spinner finally gave up but it incorrectly said "Sorry, no exercise available for this language today." and not, as it should have, "We were unable to load the exercises. Try again later, or contact support at ${email}"

---

The AppSec-er in me wants to point out that returning the version of nginx that you're using is an antipattern since it enables more targeted attacks if the version has woes; it does it in the error, and it does it in the headers

ph4evers 4 days ago|

Thanks for reporting! I'll fix the Nginx version exposure.

Yes, the server got knocked out. I was not expecting this much traffic hah. I already upgraded it but I have an NLP server with 10 language models loaded and it seems to be grinding CPU resources.

timeinput 4 days ago||

This is an amazing concept.

It would be nice to limit the YouTube content a bit like not just news, but an option for news in slow French, or something else. At least for me news in slow French is way easier to understand than news in French at 0.5x in you tube.

Maybe it's just my phone, but the dragging and dropping wasn't hit or miss it was mostly broken. On an English speaking video (my native language) filling in three gaps took me like five video repetitions to get the words in place. It made me feel a lot better about my Spanish speaking performance. Just clicking the words like someone else suggested would solve the problem completely for me, but it might be like a "hit box" problem on the words.

bomewish 3 days ago||

I'd straight up pay 5 or 10 bucks a month for this if it was like... 200% more functional/featured/professional/working. VERY good proof of concept and I love it. Target language is german fwiw.

ph4evers 2 days ago|

Thanks for the feedback! It is indeed a PoC that I'm hacking together after work.

I've been working hard to get the quality up. And now that I have some paid users for the large languages I can also auto-transcribe high quality channels. Main reason for the poor exercises (especially for German) was that I initially picked some poor channels and I was being cheap.

I've updated the german channel and that should hopefully result in a better experience.

Miraltar 4 days ago||

That's neat ! Although I got an issue on the Finnish challenge, when I drag the (correct) word "koho" it transforms into the (incorrect) word "koko". I thought I missclicked and tried the whole challenge again but I reproduced it despite being very careful.

ph4evers 4 days ago|

Thanks for trying, and sorry about that. I thought that the videos for Finnish where on a decent enough level (only checked one). I'm afraid the transcription quality is not on par yet for Finnish. I'll add a warning for the smaller languages and hopefully the models will improve.

emurph55 2 days ago||

Really enjoy this method. Please add Irish if you can! TG4 could be used as possible source material: https://www.youtube.com/@TG4TV/videos

ph4evers 2 days ago|

Thank you!

I'm using AssemlbyAI and Deepgram for the transcripts at the moment. Unfortunately, they don't support Irish. However, I did see this: https://elevenlabs.io/speech-to-text/irish . Not sure how accurate it is.

dzonga 4 days ago||

Beautiful work. this has massive potential. I like the video aspect - it's almost as how people learned languages back then by listening to CDs and Tape. but now you can read someone's lips

kmos17 4 days ago||

Just did one spanish video, worked really well. The interface is simple enough and easy, great start. It would be great to have a translation appear after completing the words, and maybe a way to save words.

sharmasachin98 4 days ago||

This feels like a great blend of immersion and repetition. Curious if you’re doing any difficulty adaptation based on content complexity or vocabulary frequency?

dvh 4 days ago||

I placed one word wrong and it didn't tell me what was the correct word, so I learned nothing, I only failed.

Also I'm maybe jlpt4 and the text was too hard, you should let me choose difficulty.

ph4evers 4 days ago|

Thanks for trying! Sorry about that. I've changed the daily exercise to an easier one for Japanese. Right now it takes random videos from all levels for the exercises, but I'm working on creating one simple one and a more challenging one.

JetSetIlly 4 days ago|

It's a good idea and something I will be interested in using

However, I was very confused by the interface at first. I started a with a 3 gap exercise. I dragged what I thought was the correct word into the gap. Listened again, changed my mind but I couldn't drag in my new choice. It was a while before I realised that the correct word had been inserted for me. This was despite me not completing the other gaps.

It would be better if the answers weren't given until the user submits the answer.

More comments...