Bluesky's Firehouse is known for being an open API, but it is also its flaw as anyone can scrape its data for the likes of AI ...
On 26 November, Daniel van Strien, a machine learning librarian at Hugging Face, uploaded a dataset of 1m public posts and ...
A Hugging Face librarian released and later removed a 1 million Bluesky posts dataset, sparking concerns over data ...
Bluesky user posts and user information was scraped by an AI researcher and built into a dataset and published on open ...
A machine learning librarian scraped public posts and created a searchable AI training dataset, without consent of users.
Although Bluesky itself doesn’t train AI models on user data, it doesn’t prevent others from using its data for training ...
Bluesky, the social media platform often seen as a rival to Twitter, is at the center of a controversy after one million of ...
Bluesky is facing its first major controversy over data scraping after a dataset containing one million public posts appeared ...
President of SCE Worldwide Studios, Shuhei Yoshida, steps down from his 38 years. Scott shares his thoughts on Yoshida’s ...
Daniel van Strien, a machine learning librarian at Hugging Face, took a million Bluesky posts and turned them into a dataset ...
Publishing giants and generative artificial intelligence companies are striking deals that aim to both protect copyright and provide for the rapidly increasing needs of the AI industry.
A quietly growing belief in Silicon Valley holds that breakthroughs from large AI models may be slowing down. Details here.