Top
Best
New

Posted by davidgu 3 days ago

Launch HN: Recall.ai (YC W20) – API for meeting recordings and transcripts

Hey HN, we're David and Amanda from Recall.ai (https://www.recall.ai). Today we’re launching our Desktop Recording SDK, a way to get meeting data without a bot in the meeting: https://www.recall.ai/product/desktop-recording-sdk. It’s our biggest release in quite a while so we thought we’d finally do our Launch HN :)

Here’s a demo that shows it producing a transcript from a meeting, followed by examples in code: https://www.youtube.com/watch?v=4croAGGiKTA . API docs are at https://docs.recall.ai/.

Back in W20, our first product was an API that lets you send a bot participant into a meeting. This gives developers access to audio/video streams and other data in the meeting. Today, this API powers most of the meeting recording products on the market.

Recently, meeting recording through a desktop form factor instead of a bot has become popular. Many products like Notion and ChatGPT have added desktop recording functionality, and LLMs have made it easier to work with unstructured transcripts. But it’s actually hard to reliably record meetings at scale with a desktop app, and most developers who want to add recording functionality don’t want to build all this infrastructure.

Doing a basic recording with just the microphone and system audio is fairly straightforward since you can just use the system APIs. But it gets a lot harder when you want to capture speaker names, produce a video recording, get real-time data, or run this in production at large scale:

- Capturing speaker names involves using accessibility APIs to screen-scrape the video conference window to monitor who is speaking at what time. When video conferencing platforms change their UI, we must ship a change immediately, so this keeps working.

- Producing a video recording that is clean, and doesn’t capture the video conferencing platform UI involves detecting the participant tiles, cropping them out, and compositing them together into a clean video recording.

- Because the desktop recording code runs on end-user machines, we need to make it as efficient as possible. This means writing highly platform-optimized code, taking advantage of hardware encoders when available, and spending a lot of time doing profiling and performance testing.

Meeting recording has zero margin for failure because if anything breaks, you lose the data forever. Reliability is especially important, which dramatically increases the amount of engineering effort required.

Our Desktop Recording SDK takes care of all this and lets developers build meeting recording features into their desktop apps, so they can record both video conferences and in-person meetings without a bot.

We built Recall.ai because we experienced this problem ourselves. At our first startup, we built a tool for product managers that included a meeting recording feature. 70% of our engineering time was taken up by just this feature! We ended up starting Recall.ai to solve this instead. Since then, over 2000 companies use us to power their recording features, e.g. Hubspot for sales call recording, Clickup for their AI note taker. Our users are engineering teams building commercial products for financial services, telehealth, incident management, sales, interviewing, and more. We also power internal tooling for large enterprises.

Running this sort of infrastructure has led to unexpected technical challenges! For example, we had to debug a 1 in 36 million segfault in our audio encoder (https://www.recall.ai/blog/debugging-a-1-in-36-000-000-segfa...), we encountered a Postgres lock-up that only occurs when you have tens of thousands of concurrent writers (https://news.ycombinator.com/item?id=44490510), and we saved over $1M a year on AWS by optimizing the way we shuffle data around between our processes (https://news.ycombinator.com/item?id=42067275).

You can try it here: https://www.recall.ai. It's self-serve with $5 of free credits. Pricing starts at $0.70 for every hour of recording, prorated to the second. We offer volume discounts with scale.

All data recorded through Recall.ai is the property of our customers, we support 0-day retention, and we don’t train models on customer data.

We would love your feedback!

97 points | 51 commentspage 2
iddan 3 days ago|
Congrats on the launch! I'm working on a new tool for startups sales (https://closer.so) and in many customer interviews the point of not wanting the bot in the meeting kept coming up. I love how Recall keeps brining frontier tech as APIs
shravankumar8 2 days ago||
Hello are you open to hiring for Remote roles ?? am looking for Full stack junior or intern roles and was truely impressed by the product
giveita 3 days ago||
I have Loom recorded a Zoom meeting so get it. I think for corporates though the integrated approaches are so so convenient. Have your meeting and get a summary email by doing nothing (or one click opt in). I feel like your solution is for edge cases where the mainstream ways are not possible.
davidgu 3 days ago|
Just to clarify, we’re the infra layer that reliably captures and normalizes meeting data across platforms. The real value for users is what developers build on top: automated analysis, enrichment, and workflows (not the capture itself)

Modern LLMs can power sales coaching, medical scribing, legal review, support QA, and compliance reporting but they need consistent inputs to process. We handle capture/formatting/edge cases so developers can focus on models and UX

aldebran 2 days ago||
Related but tangential: is there a way to grab video frames from a meeting? Audio transcript is great. Looking at a use case to grab participant video.
jitl 2 days ago|
> Back in W20, our first product was an API that lets you send a bot participant into a meeting. This gives developers access to audio/video streams and other data in the meeting. Today, this API powers most of the meeting recording products on the market.

> Desktop Recording SDK

s-zeng 2 days ago||
Hi David and Amanda! Followed you guys from the very beginning, glad to see Recall.ai get so big!
paool 2 days ago||
So this wouldn't work in a web app, only a desktop app?

I guess that leaves electron and tauri?

nottorp 2 days ago||
Is it a good idea to stay named like the spyware that MS has put in Windows 11?
moomoo11 2 days ago|
98% of people don’t know. Including me and I am here often.
berz01 3 days ago||
70 cents per hour is a mountain of fees... basically a $1 per meeting. Sheesh.
IMTDb 2 days ago||
The median salary in the US is $29/hour. By definition a one hour meeting has at least two people in it; often more. So two median guys talking for an hour costs ~$60. The meeting the you really want transcripts for often contain more than one person; and often involve people earning more than the median. I'd happily ad $1 to every single one of my meetings if they get more productive.
nduncan_hmc 3 days ago|||
It is a lot but processing real time video and audio streams inherently consumes alot of CPU. So they may not be making as much profit on that price as you'd think.

I run an open source alternative to Recall (for meeting bots), and our costs are about 8 cents per hour.

ada1981 2 days ago||
What is the open source project?
nduncan_hmc 2 days ago||
https://attendee.dev/
davidgu 3 days ago||
$0.70/hr is our starter rate for low-volume testing. In production, developers will see higher usage and choose to commit to volume and longer-term usage. Because of this, we've seen most teams don’t pay the starter price once they scale beyond early pilots
galaxy_gas 3 days ago||
Is this with active speech or you pay in every second of silence too?
davidgu 3 days ago||
Usage includes silent time too as we are still processing the media streams
galaxy_gas 3 days ago||
What is the value in paying this massive cost when Teams, Zoom all support ootb?
davidgu 3 days ago||
Enabling transcription/recordings per platform and remembering to record creates user-dependent setup. Also the host often needs to install apps which adds security friction, and you still have to build/maintain separate implementations for Zoom/Meet/Teams which is often a cost that devs don't want to deal with

Instead, we built a single API that can get the same results without the issues mentioned above so you can focus on building the features your users care about

More comments...