Posted by david927 7 days ago
Ask HN: What Are You Working On? (Nov 2025)
Thinking about building out my ai memory tool but I am looking for more hours in a day to do that :)
Odoo Cloud Hosting platform alternative to odoo.sh with additional functionalities(PGadmin, external s3 backup,...etc) and backoffice portal to create landing pages and pricing plans for your customers
It's the Microsoft stack, I'm embarrassed
So far i've got the scraping and embeddings / similarity clustering down (to build timelines of news stories), lots of data cleaning and UI refinement required. I find it hard to make choices, maybe I need a cofounder who can pair up with me. Looking to either monetize news data or build a news analysis / intelligence platform.
(I'm working on basic blog and video aggregators like Planet Python.)
So a paragraph might be good as a 384-dim vector but if you have 1,000 words then you might want a 768-dim embedding (if not higher). Embedding models have slightly better/worse accuracy based on the training data they're fed, but higher dimensionality definitely gives better results - to a great extent. If you have an extensively long piece of text, it's easier to chunk it into pieces and create separate embeddings. You do have to manually stitch them back together and do some cleanup when displaying results but it works.
Once you have embeddings for all your data the rest is just cosine similarity, play around with the min_similarity. You will need to build good indexes on postgres but it is basically all you need.