Launch HN: Cardboard (YC W26) – Agentic video editor

Posted by sxmawl 5 hours ago

Launch HN: Cardboard (YC W26) – Agentic video editor(www.usecardboard.com)

Hey HN - we're Saksham and Ishan, and we’re building Cardboard (https://www.usecardboard.com). It lets you go from raw footage to an edited video by describing what you want in natural language. There’s a demo video at https://www.usecardboard.com/share/fUN2i9ft8B46, and you can try the product out at https://demo.usecardboard.com (no login required!)

People sit on mountains of raw assets - product walkthroughs, customer interviews, travel videos, screen recordings, changelogs, etc. - that could become testimonials, ads, vlogs, launch videos, etc.

Instead they sit in cloud storage / hard drives because getting to a first cut takes hours of scrubbing through the raw footage manually, arranging clips in correct sequence, syncing music, exporting, uploading to a cloud storage to share, and then getting feedback on WhatsApp/iMessage/Slack, then re-doing the same thing again till everyone is happy.

We grew up together and have been friends for 15 years. Saksham creates content on socials with ~250K views/month and kept hitting the wall where editing took longer than creating. Ishan was producing launch videos for HackerRank's all-hands demo days and spent most of his time on cuts and sequencing rather than storytelling. We both felt that while tools like Premiere Pro and DaVinci are powerful, they have a steep learning curve and involve lots of manual labor.

So we built Cardboard. You tell it to "make a 60s recap from this raw footage" or "cut this into a 20s ad" or "beat-sync this to the music I just added" and it proposes a first draft on the timeline that you can refine further.

We built a custom hardware-accelerated renderer on WebCodecs / WebGL2, there’s no server-side rendering, no plugins, everything runs in your browser (client-side). Video understanding tasks go through a series of Cloud VLMs + traditional ML models, and we use third party foundational models for agent orchestration. We also give a dropdown for this to the end user.

We've shipped 13 releases since November (https://www.usecardboard.com/changelog). The editor handles multi-track timelines with keyframe animations, shot detection, beat sync via percussion detection, voiceover generation, voice cloning, background removal, multilingual captions that are spatially aware of subjects in frame, and Premiere Pro/DaVinci/FCP XML exports so you can move projects into your existing tools if you want.

Where we're headed next: real-time collaboration (video git) to avoid inefficient feedback loops, and eventually a prediction engine that learns your editing patterns and suggests the next low entropy actions - similar to how Cursor's tab completion works, but for timeline actions.

We believe that video creation tools today are stuck where developer tools were in the early 2000s: local-first, zero collaboration with really slow feedback loops.

Here are some videos that we made with Cardboard: - https://www.usecardboard.com/share/YYsstWeWE9KI - https://www.usecardboard.com/share/nyT9oj93sm1e - https://www.usecardboard.com/share/xK9mP2vR7nQ4

We would love to hear your thoughts/feedback.

We'll be in the comments all day :)

76 points | 34 commentspage 2

rd 4 hours ago|

Who do you think your target customer is? Curious to know if you think the money is in short form, traditional YouTube videos, or even movie studios one day.

Great website btw. The onboarding was very pleasing

sxmawl 4 hours ago|

there's value in all the categories you mentioned — we're not focusing on feature filmmakers right now.

target customers usually fall under one of these - marketers / creators / founders

RobotToaster 3 hours ago||

The 10gb file size is going to be limiting for anyone shooting prores or raw.

sxmawl 3 hours ago|

yeah, i agree. we're actively working on bumping that up. it was 5GB last week

for now, an intermediate solution is to splice and upload.

deklesen 4 hours ago||

Nice demo experience!

sxmawl 3 hours ago|

ty!

danieltk76 3 hours ago||

We use Cardboard at Vulnetic and it is an incredible product. The founders are easily accessible, and it has definitely made it easier to film feature update videos. I can't recommend them enough.

sxmawl 3 hours ago|

glad i'm able to help, i really enjoy working with you!

adboio 2 hours ago||

LET'S GOOOOOOO excellent product friends

sxmawl 2 hours ago|

ty ty!

popalchemist 2 hours ago||

Impressive UI. I assume you must be doing some kind of RAG + audio/video transcription on all the media. What's RAG architecture did you go with?

sxmawl 1 hour ago||

we've found more success with similar directions to what claude code took. maybe its closer to hybrid+agentic RAG

newbeeguy 1 hour ago||

    Firefox is not supported ...

But why?

ishandeveloper 44 minutes ago||

Totally fair question. I've actually been a longtime Gecko/Firefox user myself, so this one stings a bit.

The short answer: Firefox doesn't support the File System Access API (https://caniuse.com/?search=File+System+Access+API).

We made a deliberate decision to go client-first. Video editing happens entirely in your browser without us uploading your entire footage on our end. No bandwidth costs for you, no storing your raw video on our servers. The File System Access API is what makes that possible, and unfortunately Firefox just doesn't have it yet.

It's not a forever thing though. For cloud-based projects where files live on our end anyway, Firefox support is very much on the roadmap. But for the local-first editing flow, our hands are a bit tied until Mozilla ships it.

Hope that makes sense, and fingers crossed Firefox adds support soon!

jhatemyjob 2 hours ago||

> We built a custom hardware-accelerated renderer on WebCodecs / WebGL2, there’s no server-side rendering, no plugins, everything runs in your browser (client-side).

Aight imma head out. Holy moly.

sxmawl 2 hours ago|

haha xD

TimCTRL 3 hours ago|

$60...eh

ishandeveloper 36 minutes ago|

Totally fair reaction! Here's our honest thinking behind it.

We deliberately avoided credits/usage-based pricing because as founders using this in our own creative workflow, we hate the cognitive load that comes with it.

If I don't like a voiceover/variation, I should have the freedom to regenerate it until I'm happy without thinking about whether it's "worth" a credit.

That said, we could be wrong! Genuinely curious what you think would feel fair?