Skip to content

pritpatel2412/ARIA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ•΅οΈ ARIA v2.0

Autonomous Vision-Enabled Agentic Platform for Web Research & Competitive Intelligence

Decompose complex queries into parallel browser agents. Get real answers, not hallucinations.


What is ARIA?

ARIA is not a chatbot. It's an autonomous multi-agent system that browses the real web in parallel to answer complex research and competitive intelligence queries.

When you give ARIA a goal, it:

  1. Decomposes your query into 1–6 parallel sub-tasks using Groq LLaMA 3.3
  2. Spawns real browser agents β€” each navigating live websites simultaneously
  3. Extracts structured data from pricing pages, job boards, news sites, financial dashboards
  4. Synthesises all results via Gemini 1.5 Pro into a coherent report
  5. Saves everything to PostgreSQL β€” full session history, resume profiles, watchlists

The key differentiator: real browsers, real pages, real-time data β€” no hallucination, no stale training data.


8 Agent Modes

Mode What It Does
πŸ” Research Analyst Multi-source deep research with source citations
πŸ’Ό Career Copilot Resume parsing + live job search + cover letter generation
πŸ’° Money Agent Real-time stock prices, analyst ratings, ETF comparisons
⚑ Form Executor Autonomously fills and submits real web forms
πŸ€– Assistant General-purpose browser-powered AI assistant
πŸ“ Content Agent Research-backed content generation with live sources
🎯 Job Autopilot Discovers and applies to jobs automatically
πŸ‘οΈ Competitor Watch Monitors competitor pricing and product changes

Key Technical Highlights

🦾 Swarm Architecture

  • Decentralized multi-agent design β€” agents run in parallel, not sequence
  • Each agent has its own browser session, task scope, and result stream
  • SSE-based real-time streaming β€” watch agents work live

πŸ‘οΈ Visual Delta Analysis

  • Multimodal AI detects semantic changes in websites via screenshot comparison
  • Bounding box localization of changed elements
  • Used for competitor monitoring and content change detection

πŸ‘€ Ghost Executive

  • Autonomous career agent that parses resumes into structured profiles
  • Performs intelligent job discovery and form automation
  • Supports 11 Indian languages via Sarvam AI (voice input + TTS output)

🌐 Stealth Browser Automation

  • Real-time browser automation with stealth profiles
  • Reliable navigation across dynamic and JS-heavy sites
  • Handles anti-bot protection via profile rotation

Tech Stack

Layer Technology
Frontend Next.js 15, TypeScript, Tailwind CSS
LLM (Planning) Groq LLaMA 3.3 70B (ultra-fast inference)
LLM (Vision/OCR) Gemini 1.5 Pro
Browser Automation TinyFish real browser agents + SSE streaming
Voice Sarvam AI (11 Indian languages, TTS + STT)
Database Supabase PostgreSQL
Auth Supabase Auth

Architecture

User Goal (text or voice)
        β”‚
        β–Ό
  Groq LLaMA 3.3 (Intent Parsing + Task Decomposition)
        β”‚
        β–Ό
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
  β”‚      Agent Swarm (parallel)      β”‚
  β”‚  Agent 1 β”‚ Agent 2 β”‚ Agent 3    β”‚
  β”‚  Browser β”‚ Browser β”‚ Browser    β”‚
  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
        β”‚
        β–Ό
  Gemini 1.5 Pro (Synthesis + Vision)
        β”‚
        β–Ό
  Structured Report + Voice Output (Sarvam AI)
        β”‚
        β–Ό
  Supabase PostgreSQL (Session History)

Getting Started

git clone https://github.com/pritpatel2412/ARIA
cd ARIA
pnpm install
cp .env.example .env  # Add your API keys
pnpm dev

Required env vars: GROQ_API_KEY, GEMINI_API_KEY, SUPABASE_URL, SUPABASE_SERVICE_ROLE_KEY, SARVAM_API_KEY


Built By

Prit Patel β€” B.Tech CSE @ CHARUSAT University GitHub Β· LinkedIn

About

πŸ•΅οΈ Autonomous multi-agent web research platform β€” parallel browser agents, swarm architecture. Groq LLaMA 3.3, Gemini 1.5 Pro, SSE streaming.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages