Skip to content
View haseebraza715's full-sized avatar

Highlights

  • Pro

Organizations

@Agentic-Systems-Lab @Data-Science-Club-ELTE

Block or report haseebraza715

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
haseebraza715/README.md

building agents. then figuring out where they break.

cs @ elte · ai engineering intern @ infineon · research assistant @ eth zurich · founder, elte ds&ai club (100+ members)

Gmail


what i'm building

project what it does
agent-autopsy terminal forensics for agent traces — failure loops, root causes, structured fixes
AuditLens fairness auditing for ML datasets — statistical tests + LLM guidance
multi-agent-reflection-engine 5-agent reflection system with eval pipeline and audit snapshots
QualModel turns survey responses and interview transcripts into structured research models
obsidian-ai-agent AI agent for Obsidian — answers grounded in your own notes

research  ·  papers in prep

project what it does status
failure-aware-ocr-rag OCR + RAG pipeline with typed failure handling and recovery routing active
llm-bias-evaluation persona-based bias eval across LLMs — 15,552 personas @ elte rc2s2 paper in prep
EngageMind RAG study assistant with chat memory and GPT-2 LoRA — BSc thesis @ elte almost :)

research without deployment is theory. deployment without evaluation is guesswork.

Pinned Loading

  1. EngageMind EngageMind Public

    BSc thesis project: a full-stack AI study assistant that grounds answers in user documents and supports per-user GPT-2 LoRA adaptation.

    JavaScript

  2. acom-embedding-grid acom-embedding-grid Public

    ELTE research on ACOM-based discrete semantic mapping of document embeddings, benchmarked against PCA, t-SNE, and UMAP.

    TeX

  3. llm-bias-evaluation llm-bias-evaluation Public

    ELTE RC2S2 research pipeline for persona-based attitudinal framing bias evaluation across LLMs.

    HTML 1

  4. multi-agent-reflection-engine multi-agent-reflection-engine Public

    Production-grade multi-agent reflection engine with evaluation pipeline, audit snapshots, and model-routed agent orchestration.

    TypeScript

  5. agent-autopsy agent-autopsy Public

    Agentic trace analyzer that detects failure loops, root causes, and structured remediation paths.

    Python 1

  6. obsidian-ai-agent obsidian-ai-agent Public

    AI agent for Obsidian that reasons over personal notes with grounded, context-aware responses.

    Python