Hacker News new | past | comments | ask | show | jobs | submit login

>objectively a huge difference in political plurality in US training material

Under that condition, then objectively US training material would be inferior to PRC training material since it is (was) much easier to scrape US web than PRC web (due to various proprietary portal setups). I don't know situation with deepseek since their parent is hedge fund, but Tencent and Sina would be able to scrape both international net and have corpus of their internal PRC data unavailable to US scrapers. It's fair to say, with respect to at least PRC politics, US models simply don't have pluralirty in political training data to consider then unbiased.




So you argument is that Chinese AI companies are less biased because they have access to tightly controlled Chinese internet data?

Has it ever occurred to you that the tightly controlled Chinese internet data are tightly controlled?

Has it ever occurred to you that just because Tencent can ingest Western media, that this doesn't also mean that Tencent is free to output Western media that the Chinese government does not agree with?

Please go back to school and study harder, you have disappointed me. EMOTIONAL DAMAGE.


The argument is PRC models can use data corpus from both sides of the great fire wall, whereas US models can't, hence US models technically incapable of being unbiased, whereas PRC at least could be.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: