About Mira

Multi-Agent AI System for Company Research

About Mira

Company information lives in many places: websites, LinkedIn, news, press releases, job postings, social media, review sites. Getting the full picture means checking multiple sources and deciding what to trust.

Mira does this automatically. It's a multi-agent AI system that gathers information from company websites, LinkedIn profiles, and Google Search, then assembles a structured profile with confidence scores and clear source attribution.

The system features smart early termination - once all configured data points reach high confidence scores, it automatically stops processing to save time and API costs. Sources are fully configurable, allowing you to enable or disable website crawling, LinkedIn analysis, and Google Search based on your needs.

The core of Mira is a framework-agnostic library that can be published as an npm package or integrated directly into your applications, pipelines, or custom workflows.

To show how it works in practice, this repository includes a complete Next.js frontend application that consumes the core library and provides a full interface with workspace management for running research and viewing results. For processing large datasets, a bulk processing tool is also included.

Mira in action

Key Features

Data Collection

Configurable Data Points – Define exactly what information to collect (company name, industry, funding, etc.) with custom descriptions for precise extraction.
Intelligent Source Selection – Enable/disable website crawling, LinkedIn analysis, and Google Search based on your needs.
Service Layer for Data Gathering – Built-in services handle scraping, Google Search, LinkedIn company data, and cookie consent banners.

Intelligence & Orchestration

Multi-Agent Architecture – Specialized agents handle discovery, internal pages, LinkedIn, Google Search, and analysis, with intelligent orchestration.
Smart Early Termination – Automatically stops processing when all data points reach high confidence scores, saving time and API costs.
Confidence Scoring & Source Attribution – Each fact includes a confidence score (1-5) and references its source for transparency and trust.

Analysis & Output

Company Analysis & Criteria Matching – Generate executive summaries and evaluate companies against custom criteria with fit scores (0-10) and detailed reasoning.
Personalized Outreach Generation – AI-powered LinkedIn and email outreach message creation based on enriched company data with customizable prompts.

Developer Experience

Realtime Progress Events – Emits structured events during execution so you can track and display live progress.
Composable Core Library – Framework-agnostic and publishable as an npm package, ready for Node.js/TypeScript projects.
Complete Next.js Frontend – Full application showing how to consume the library with workspace management, user authentication, and live progress updates.
Bulk Processing – Process large datasets with resume capability, concurrent processing, and automatic result export.

How it works

Mira takes a company's website URL and your configuration, then intelligently orchestrates multiple AI agents to gather comprehensive company information. You can customize exactly what data to collect and which sources to use.

Configuration

Data Points: Define custom data points with names and descriptions (e.g., "industry": "Primary business sector or market vertical")
Sources: Enable/disable website crawling, LinkedIn analysis, and Google Search (landing page is always analyzed)
Analysis: Optionally enable executive summary generation and/or provide company criteria for fit scoring

Intelligent Orchestration

Discovery agent analyzes the landing page, extracts social profiles, and identifies relevant internal pages
Internal pages agent (if enabled) scans discovered pages for data points that need improvement
LinkedIn agent (if enabled) gathers additional details, but only for missing or low-confidence data points
Google Search agent (if enabled) queries for remaining gaps using targeted searches
Company analysis agent (if configured) generates executive summary and/or evaluates company criteria fit

Smart Early Termination

The system continuously monitors data point confidence scores. If all configured data points reach the minimum confidence threshold, processing automatically terminates early to save time and API costs.

Data Merging & Confidence

Every data point includes a confidence score (1-5) and source attribution
When multiple sources provide the same information, higher confidence scores take precedence
Real-time progress events are emitted throughout execution for live status tracking

Architecture Diagram

Tech Stack

Core Library (`packages/mira-ai`)

Node.js – runtime environment.
TypeScript – type safety and maintainability.
OpenAI Agents SDK – multi-agent orchestration and reasoning.
ScrapingBee – API-based scraping, used for both website crawling and Google Search.
Zod – runtime schema validation and input/output type enforcement.
Jest – testing framework for validating services and agents individually.

Frontend Application (`apps/mira-frontend`)

Next.js – full-featured interface to run enrichment and display results.
Supabase – user authentication and workspace storage.
Workspace Management – create and manage multiple research configurations with custom data points, sources, and analysis settings.
TypeScript – Consumes core library types.
TailwindCSS – styling for the UI.
shadcn/ui – accessible, prebuilt UI components.

Bulk Processing (`apps/mira-bulk`)

Node.js – standalone CLI for processing large datasets.
SQLite – local progress tracking with resume capability.
PQueue – concurrent processing with configurable parallelism.

Requirements

Node.js v18 or later (ensures compatibility with the OpenAI Agents SDK)
npm (comes with Node.js) or pnpm/yarn as your package manager
API Keys:
- OPENAI_API_KEY — for agent orchestration
- SCRAPING_BEE_API_KEY — for web scraping and Google Search
Supabase Account (for frontend) — user authentication and workspace storage

Environment Variables

Mira requires API keys to function. Environment files are used to separate configuration for local development and testing.

Core Library Testing (`packages/mira-ai/.env.test`)

For testing the core library, create a .env.test file:

OPENAI_API_KEY=sk-xxxx
SCRAPING_BEE_API_KEY=xxxx

Frontend (`apps/mira-frontend/.env.local`)

For running the frontend, create a .env.local file with additional Supabase configuration:

OPENAI_API_KEY=sk-xxxx
SCRAPING_BEE_API_KEY=xxxx

NEXT_PUBLIC_SUPABASE_URL=your-supabase-url
NEXT_PUBLIC_SUPABASE_ANON_KEY=your-supabase-anon-key

Installation & Setup

You can use Mira in three ways:

Local Development (run the frontend application with workspaces and the core library)
As an npm Package (use the mira-ai library directly in your own project)
Bulk Processing (process large datasets via CLI)

1. Local Development

Clone the repository and install dependencies:

git clone https://github.com/dimimikadze/mira.git
cd mira
npm install

Frontend Setup

Create apps/mira-frontend/.env.local with your API keys and Supabase configuration:

OPENAI_API_KEY=sk-xxxx
SCRAPING_BEE_API_KEY=xxxx

# Supabase
NEXT_PUBLIC_SUPABASE_URL=your-supabase-url
NEXT_PUBLIC_SUPABASE_ANON_KEY=your-supabase-anon-key

Run database migrations:

npm run db:migrate

Generate TypeScript types from your Supabase schema:

npm run generate-types

Start the frontend application:

npm run dev:mira-frontend

2. Use as an npm package (mira-ai library)

npm install mira-ai

import { researchCompany } from 'mira-ai';

const config = {
  apiKeys: {
    openaiApiKey: process.env.OPENAI_API_KEY!,
    scrapingBeeApiKey: process.env.SCRAPING_BEE_API_KEY!,
  },
};

const result = await researchCompany('https://company.com', config, {
  enrichmentConfig: {
    // Define custom data points to collect
    dataPoints: [
      { name: 'industry', description: 'Primary business sector' },
      { name: 'employeeCount', description: 'Number of employees' },
      { name: 'funding', description: 'Latest funding round and amount' },
      { name: 'recentNews', description: 'Recent company news or updates' },
    ],
    // Configure which sources to use (landing page is always analyzed)
    sources: {
      crawl: true, // Enable internal pages crawling
      linkedin: true, // Enable LinkedIn analysis
      google: true, // Enable Google Search
    },
    // Configure analysis options
    analysis: {
      executiveSummary: true, // Generate executive summary
      companyCriteria: 'B2B SaaS companies with 50-200 employees', // Evaluate fit against criteria
    },
  },
  onProgress: (type, message) => {
    console.log(`${type}: ${message}`);
  },
});

console.log(result.enrichedCompany);
console.log(result.companyAnalysis);

3. Bulk Processing

For processing large datasets, use the bulk processing tool. It reads from a CSV, processes companies with configurable concurrency, saves progress to SQLite for resume capability, and exports results to Supabase storage.

See Mira Bulk README for setup and deployment instructions.

Authentication

The frontend application uses Supabase for user authentication and workspace management. Users can sign up and sign in through the Supabase Auth system, with each user having access to their own private workspaces.

Additional Documentation

This monorepo contains three main packages, each with its own README:

Mira AI Library — Node.js/TypeScript library with agents, services, and orchestration logic.
Mira Frontend — Next.js application with workspace management for running enrichment and visualizing results.
Mira Bulk — CLI tool for bulk company enrichment with resume capability.

AI-Assisted Development

If you're developing with AI tools like Cursor, configuration rules are already set up in the root, library, and frontend packages to ensure consistency.

Contributing

See CONTRIBUTING.md for guidelines.

License

Distributed under the MIT License. See LICENSE for details.

Credits

Logo and UI design by salomeskv

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.cursor		.cursor
.github/workflows		.github/workflows
apps		apps
packages/mira-ai		packages/mira-ai
supabase		supabase
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Diagram.png		Diagram.png
LICENCE		LICENCE
README.md		README.md
Screenshot.png		Screenshot.png
mira.code-workspace		mira.code-workspace
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About Mira

Mira in action

Key Features

Data Collection

Intelligence & Orchestration

Analysis & Output

Developer Experience

How it works

Architecture Diagram

Tech Stack

Core Library (`packages/mira-ai`)

Frontend Application (`apps/mira-frontend`)

Bulk Processing (`apps/mira-bulk`)

Requirements

Environment Variables

Core Library Testing (`packages/mira-ai/.env.test`)

Frontend (`apps/mira-frontend/.env.local`)

Installation & Setup

1. Local Development

Frontend Setup

2. Use as an npm package (mira-ai library)

3. Bulk Processing

Authentication

Additional Documentation

AI-Assisted Development

Contributing

License

Credits

About

Uh oh!

Releases 1

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About Mira

Mira in action

Key Features

Data Collection

Intelligence & Orchestration

Analysis & Output

Developer Experience

How it works

Architecture Diagram

Tech Stack

Core Library (packages/mira-ai)

Frontend Application (apps/mira-frontend)

Bulk Processing (apps/mira-bulk)

Requirements

Environment Variables

Core Library Testing (packages/mira-ai/.env.test)

Frontend (apps/mira-frontend/.env.local)

Installation & Setup

1. Local Development

Frontend Setup

2. Use as an npm package (mira-ai library)

3. Bulk Processing

Authentication

Additional Documentation

AI-Assisted Development

Contributing

License

Credits

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Contributors

Uh oh!

Languages

Core Library (`packages/mira-ai`)

Frontend Application (`apps/mira-frontend`)

Bulk Processing (`apps/mira-bulk`)

Core Library Testing (`packages/mira-ai/.env.test`)

Frontend (`apps/mira-frontend/.env.local`)