BBC News Dataset – February 2023 Edition
bbc.com · CSV
Get access to a comprehensive and structured dataset of BBC News articles, freshly crawled and compiled in February 2023. This collection includes 1 million records from one of the world’s most trusted news organizations — perfect for training NLP models, sentiment analysis, and trend detection across global topics.
💾 Format: CSV (available in ZIP archive)
📢 Status: Published and available for immediate access
Use Cases-
Train language models to summarize or categorize news
-
Detect media bias and compare narrative framing
-
Conduct research in journalism, politics, and public sentiment
-
Enrich news aggregation platforms with clean metadata
-
Analyze content distribution across categories (e.g. health, politics, tech)
This dataset ensures reliable and high-quality information sourced from a globally respected outlet. The format is optimized for quick ingestion into your pipelines — with clean text, timestamps, image links, and more.
Need a filtered dataset or want this refreshed for a later date? We offer on-demand news scraping as well.
👉 Request access or sample now
Fields
title, url, published_at, author, publisher, short_description, header_image, category, raw_description, description, uniq_id, scraped_at