More

cahaya · 2025-08-20T12:22:40 1755692560

I was also looking for a video. The concept sounds good, but feels like I need to learn a lot of new commands, or have a cheat sheet next to me to be able to be able to use the framework.

aroussi · 2025-08-20T13:12:01 1755695521

Cheatsheet is available via /pm:help

With that being said, a video will be coming very soon.

cahaya · 2025-08-06T14:35:35 1754490935

Wondering how average users can benefit from this platform with Claude Code and the relation to Vending Bench that tracks how much money LLM's can make.

https://andonlabs.com/evals/vending-bench

cahaya · 2025-07-25T13:40:12 1753450812

Nice! Missing a cost calculator with input and output fields.

alexellman · 2025-07-25T13:55:07 1753451707

Can add for the future

cahaya · 2025-07-21T11:19:11 1753096751

Exactly. If Conductor would work with local branches I would switch from Crystal.

cahaya · 2025-07-18T19:14:44 1752866084

I can confirm. When trying convert simple Word sentences and tables to e.g. Markdown/HTML from a Word XML you need a PhD in XML edge cases and nested garbage.

paulbjensen · 2025-07-18T19:41:04 1752867664

I wonder if this tool by MSFT is able to handle that:

https://github.com/microsoft/markitdown

I was amazed when I realised that Word docs were just zip files and you could poke around in the xml files embedded inside of them.

I almost implemented a working React -> Word document renderer back in 2017, but it didn't have support for creating the xml tags with : inside of them (which OOXML documents use).

favorited · 2025-07-18T20:17:22 1752869842

Even though markitdown is a Microsoft project, it's just a thin wrapper around a bunch of 3rd party Python packages. For example, to go from docx to Markdown, it uses mammoth to convert docx to HTML[0], then uses markdownify to convert the HTML into Markdown[1].

[0]https://github.com/microsoft/markitdown/blob/da7bcea527ed04c... [1]https://github.com/microsoft/markitdown/blob/da7bcea527ed04c...

strongpigeon · 2025-07-18T20:08:35 1752869315

Technically, they're a bit more than just zip files (they're OPC containers [0]), but if you're hand editing the file content it doesn't really matter.

[0] Open Package Convention: https://en.wikipedia.org/wiki/Open_Packaging_Conventions

superjan · 2025-07-18T20:03:03 1752868983

Well, it is not pretty to see how the sausage gets made, but extracting formatted text from docx is absolutely doable, no PhD involved. Source: I have done it as a little sidequest because it was useful to audit a set of word documents.

cahaya · 2025-04-28T09:12:53 1745831573

Building a AI-assisted aviation regulation compliance tool for aviation professionals: https://aviation.bot At the moment just a RAG with EU aviation regulations. Soon FAA, UK CAA, and more complex AI agent features that do more complex deep research.

cahaya · 2025-04-15T16:37:16 1744735036

Wondering how this compares to the Gemini (preview) embeddings as they seem to perform significantly better than OpenAI embeddings 3 large. I don't see any MTEB scores so hard to compare.

mahjongmen · 2025-04-15T17:32:31 1744738351

Hey Cahaya,

While we benchmarked internally, on BEIR, we opted not to report our model onto MTEB for the following reason:

1) MTEB has been gamed - if you look at this model (https://huggingface.co/voyageai/voyage-3-m-exp) on the MTEB leaderboard, its an intermediate checkpoint of Voyage-3-Large where they finetuned it on datasets that represent MTEB datasets.

2) If you look at the recent datasets in MMTEB, you'll find that it has quite a lot of machine translated or "weird" datasets that are quite noisy

In general, for our Search Models, we benchmark on these public academic datasets but we definitely do not try to hillclimb in this direction as we find it has little correlation with real use-cases

cahaya · 2025-01-20T03:28:16 1737343696

Recommendations? I'm seeking a robotics course akin to prompt engineering, LangChain, or LangGraph that prepares me for the 'ChatGPT' moment in robotics. While SLAM navigation and similar topics are vital, it feels reminiscent of five years ago when one needed to master machine learning to create their own ML models. Nowadays, most LLM models are API-based and don't require an ML background.

cahaya · 2024-12-10T05:20:01 1733808001

I'm wondering how this Google announcement will have an impact on their future revenue, and how easily competitors can replicate this breakthrough?

cahaya · 2024-11-18T06:23:21 1731911001

Would be cool to run these 3d terminal models as Mac screensaver!