Agentic_News - https://agenticnews.vercel.app/
This project is an AI agentic news gather,summarisation and publishing website that scrapes news articles, summarizes them using Facebook's BART model, and displays them in a user-friendly UI built with Next.js. The backend scrapes data from news websites and stores it in MongoDB, ensuring continuous updates with automated scraping cycles.
Video - (https://youtu.be/xuS2gxCtG6g?si=S-9smPQcXEXMMQ6N)
- AI-powered Summarization: Uses the BART model to generate concise news summaries.
- MongoDB Integration: Stores news data, ensuring no duplicates.
- Automated Scraping: Fetches news at regular intervals.
- Next.js Frontend: Optimized for SEO with real-time updates.
- City-based Filtering (Upcoming Feature): Users can filter news by city.
git clone https://github.com/PratikChrs/Agentic_News.git
cd frontendnpm installCreate a .env.local file in the frontend directory and add:
MONGO_URI=your_mongodb_connection_stringnpm run dev- Uses Facebook's BART model for summarizing news articles.
- The model is downloaded once and stored locally to avoid redundant downloads.
The database stores news data with these fields:
- id (Unique identifier)
- date (Publication date)
- heading (News headline)
- image URL (Thumbnail image)
- article URL (Source link)
- summary (BART-generated summary in an array format) it prevents duplication by checking existing articles before storing new ones.
- Scrapes news from Times of India (expandable to other sources).
- Uses BeautifulSoup to extract relevant data from HTML.
- Filters news based on date to fetch only the latest articles.
- Location-based filtering is hardcoded to Ranchi but can be expanded.
- Extracts date, key points,images and links from articles.
- The BART model condenses each article into 4-5 key lines.
- The summarized content is stored in MongoDB.
When executed, the scraper:
- Connects to Hugging Face to load the BART model.
- Connects to MongoDB to check for existing articles.
- Fetches only new articles (skipping stored ones).
- Stores newly found articles in MongoDB.
- Automatically Runs at fixed intervals to keep data fresh.
- Dynamically fetches news articles from MongoDB.
- Displays articles in descending order (latest first).
- Each news card contains:
- Headline
- Summarized content
- Date and Image
- Auto-refreshes every 30 seconds to fetch new articles.
- UI updates automatically when new data enters MongoDB.
- If an article is deleted, it disappears on the next refresh.
- SEO Score: 98 on https://pagespeed.web.dev/ (optimized for search engines).
- Next.js improves performance for better rankings.
- The scraper runs automatically
- If an article is deleted, the scraper re-fetches it in the next cycle.
- Ensures the database and frontend stay updated.


.png)