Not saying it’s for sure the case but it might be that the model gets confused by OOD text from the other model whereas it expects its own text to be online from itself (particularly if the CoT is used as context for later conversations).
OMG, this. I was working on a tool to help integrate password managers on macOS and I got completely blocked by the notarizing requirements. Some things literally cannot be built for macOS as open source software, now.
I went to an "elite" school, but I worked at one HFT where the local team of 5 people had only me with a background from a similar school, and another HFT where I was one of three on my immediate team of ~12 engineers. At one point, one of my colleagues at the second firm asked me why I was there as opposed to the alternatives of (1) a company with more of a focus on the elite and cultivating power or (2) being an artist supported by my well-off parents.
Related: I've been pretty frustrated with cursor that you can't call any APIs directly, and it all has to go through their backend. Along with this, they have essentially refused to fix bugs in their custom API key page for months, forcing you to use their "1st party" tokens.
You may be happy with RooCode if you want to use direct APIs if you want (there are tons of supported backend).
The minus of it, is that it is super expensive, unless you have some tricks (you can reuse the LLMs of GitHub Copilot Pro like Claude Anthropic for example, but there is a quota, you can also use Gemini Flash 2.0 Experimental for free at 15 requests per minute, etc.).
Is the history of China any more famine stricken than any other region if the world? In recent history (last two centuries) maybe, but before that? Typically you don't become the worlds most populous country by having frequent famines.
No, a state-based RL research like this is essentially an IK problem. Given a goal position in the world frame, you need to find out the motor configuration to move your EEF to that goal position.
> IK to my knowledge is well known in every setting I am aware of.
Really? When I was working in this field, I've actually never seen anyone who used numerical/analytical IK methods on real robots.
Granted that I was not a Robotics engineer (I was a Deep Learning engineer in the team), but my impression at the time was that no practical IK solution was available for a robot with 8 DOF, let alone a 20 DOF robot like Shadow Hand.
Here’s the simple reason why this is not the case:
If you provide the 6DoF trajectory (+ gripper joints), a lot of robotics (manipulation in particular) is basically solved. The problem is, we don’t have these good trajectories.
Sure, joint space is commonly used for learned policies, but cartesian space isn’t uncommon either.
IK is really just not a major focus on the learning side of robotics because it’s not the problem. The problem is we don’t know what to do even at the slightly higher level.
I tried out uv a bit ago and dropped it. But about two weeks ago, I switched to it and migrated two projects with no issues.
Things like pypi sources per dep are there finally.
I still find rough points (as many others pointed out, especially with non sandboxed installs), that are problematic, but on the whole it’s better than Mamba for my use.
I don’t see why the industry couldn’t move to providing this documentation/full source over a few years.