Deep search
Search
Copilot
Images
Videos
Maps
News
Shopping
More
Flights
Travel
Hotels
Real Estate
Notebook
Top stories
Sports
NFL
U.S.
2024 Election
Local
World
Science
Technology
AI
Entertainment
Business
More
Politics
Any time
Past hour
Past 24 hours
Past 7 days
Past 30 days
Best match
Most recent
Bluesky, Hugging Face
Bluesky dataset for AI training removed from Hugging Face
On 26 November, Daniel van Strien, a machine learning librarian at Hugging Face, uploaded a dataset of 1m public posts and accompanying metadata taken from Bluesky’s firehose API. The dataset card explained it was “intended for machine learning research and experimentation with social media data”.
One million public Bluesky posts scraped for AI training
Bluesky is already facing its first major AI scrape, despite the stance of its owners that it will never train generative AI on user data.
Hugging Face’s Dataset Release Exposes 1M Bluesky Posts for Research
A Hugging Face librarian released and later removed a 1 million Bluesky posts dataset, sparking concerns over data transparency and consent. Daniel van
Bluesky’s open API means anyone can scrape your data for AI training
Bluesky might not be training AI systems on user content as other social networks are doing, but there’s little stopping third-parties from doing so. Per a report by 404 Media, a machine learning librarian at AI firm Hugging Face pulled 1 million public posts from Bluesky via its Firehose API for machine learning research,
Twitter rival Bluesky’s user posts scraped for training AI
Bluesky user posts and user information was scraped by an AI researcher and built into a dataset and published on open platform, Hugging Face.
2d
Hugging Face’s SmolVLM could cut AI costs for businesses by a huge margin
Hugging Face launches SmolVLM, a compact and efficient vision-language AI model, offering businesses a cost-effective ...
3d
on MSN
OpenAI's Sora Leaked Online Over 'Unpaid Labor,' Artists Say
OpenAI's text-to-video generator, Sora, was leaked online by digital artists protesting their use as beta testers for what ...
13h
on MSN
Dell expands AI capabilities with new PowerEdge servers using high-performance AMD processors
Dell's new PowerEdge servers look to accelerate deployment timelines and optimize performance for AI applications globally.
techzine
2d
Hugging Face model SmolVLM requires a lot less compute
SmolVLM is a model that can process visual input and generate textual output. It distinguishes itself by requiring ...
2d
Sora 2.0 Leak: How OpenAI’s Video Model Sparked Contributor Concerns
The Sora 2.0 leak highlights OpenAI’s AI advancements and raises questions about transparency, unpaid labor, and ethical ...
3d
Artists Stick It to ‘AI Overlords’ and Leak OpenAI’s Sora Video Generator
A group of artists who say they were given early access to OpenAI's Sora video generation model released a version of the ...
4d
on MSN
OpenAI’s Sora video generator appears to have leaked
A group of artists appears to have leaked access to Sora, OpenAI's video generator, in protest of what it's calling duplicity ...
3d
OpenAI’s Sora Tool Leaked By Group Of Aggrieved Early Testers
At the heart of the controversy is a multifaceted conflict involving technological advancement, ethical concerns and artistic ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results
Related topics
OpenAI
Sora
Bluesky
Artificial intelligence
SmolVLM
Feedback