Skip to main content

Posts

Showing posts from December, 2020

AL Jazeera English News Dataset

Explore a structured sample of English-language news articles published by Al Jazeera , extracted and formatted for easy analysis and use in NLP, content modeling, or media studies. 📊 Dataset Overview: Total Records: 3,400+ news articles Fields Included (16): Title , Author , URL , Content , Raw Content , Header Image , Images , Language , Website , Post Date , Category , Sub Category , and more. 📦 Use Cases: 🧠 Text classification and summarization training 📰 Headline generation or news clustering 🌍 Language modeling and bias analysis in journalism 🔎 Building news monitoring dashboards 📥 Download: 👉 Download Demo Dataset (Free) Want full access to 150,000+ recent news articles from Al Jazeera, CNN, NDTV, India Today, and more? 📩 Browse our full News Datasets → or Contact us for bulk or custom crawls.