gqa

Here are 28 public repositories matching this topic...

bknyaz / sgg

Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization [BMVC 2020, ICCV 2021]

computer-vision deep-learning graph pytorch generative-adversarial-network gan scene-graph message-passing paper-implementations visual-genome scene-graph-generation gqa augmentations wandb

Updated Jun 18, 2023
Jupyter Notebook

Bruce-Lee-LY / decoding_attention

Star

Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.

gpu cuda inference nvidia mha mla multi-head-attention gqa mqa llm large-language-model flash-attention cuda-core decoding-attention flashinfer flashmla

Updated Jun 11, 2025
C++

phiyodr / vqaloader

Star

PyTorch DataLoader for many VQA datasets

pytorch vqa dataloader gqa textvqa vqav2

Updated Jan 10, 2023
Python

The-Swarm-Corporation / Hyena-Y

Star

A PyTorch implementation of the Hyena-Y model, a convolution-based multi-hybrid architecture optimized for edge devices.

ai tensorflow model ml transformers pytorch attention agents ssms hyena gqa loguru

Updated Oct 6, 2025
Python

DigitalPhonetics / Intrinsic-Subgraph-Generation-for-VQA

Star

Predicting a subgraph alongside the answer in a graph based VQA model

vqa discrete sampling subgraph interpretability masking visual-question-answering explainable-ai graph-neural-networks gqa scene-graphs interpreatable-ai subgraph-extraction graph-based-vqa

Updated Jan 21, 2025
Python

haukzero / from-mha-to-mla

Star

MHA, MQA, GQA, MLA 相关原理及简要实现

attention mha mla gqa mqa

Updated Jan 23, 2025
Python

leaderj1001 / Vision-Language

Star

Vision-Language, Solve GQA(Visual Reasoning in the Real World) dataset.

vqa vision-language gqa

Updated May 9, 2019
Python

ycchen218 / VisionQA-Llama2-OWLViT

Star

This is a multimodal model design for the Vision Question Answering (VQA) task. It integrates the Llama2 13B, OWL-ViT, and YOLOv8 models.

deep-learning vqa llama gqa yolov8 owl-vit

Updated Jun 13, 2024
Python

ExplainableML / ZS-A2T

Star

[GCPR 2023] Zero-shot Translation of Attention Patterns in VQA Models to Natural Language

vqa vlm opt gqa llm zs-a2t gqa-rex vqa-x attention-rollout gcpr-2023

Updated Jul 28, 2023

A RAG-based question-answering system that processes user queries using local documents. It extracts relevant information to answer questions, falling back to a large language model when local sources are insufficient, ensuring accurate and contextual responses.

openai rag gqa llm generative-qa chromadb ollama local-rag

Updated Dec 20, 2024
Python

Haiderkhan64 / H64LM

Star

249M-param MoE transformer built from scratch in PyTorch. GQA, RoPE, SwiGLU, sparse MoE with 3 aux losses, AMP training loop no Trainer abstractions. Architecture mirrors LLaMA/Mistral/Mixtral decisions, fully inspectable.

nlp research deep-learning pytorch transformer moe language-model from-scratch rope mixture-of-experts gqa llm rmsnorm swiglu

Updated Apr 28, 2026
Jupyter Notebook

alexmirrington / honours-thesis

Star

LaTeX files for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"

deep-learning pytorch vqa scene-graph visual-reasoning visual-question-answering graph-attention-networks compositional-attention-networks gqa

Updated Aug 16, 2021
TeX

anup1005 / Transformers-Attention_is_all_you_need

Star

PyTorch implementation of the Transformer architecture (“Attention Is All You Need”) for English–Italian text-to-text translation, featuring encoder–decoder layers, multi-head attention, and training on a subset of the OPUS Books dataset.

attention-is-all-you-need multi-head-attention gqa transformers-models

Updated Jun 15, 2025
Jupyter Notebook

alexmirrington / gat-vqa

Star

Source code for my honours thesis: "Graph Attention Networks for Compositional Visual Question Answering"

deep-learning pytorch vqa scene-graph visual-reasoning visual-question-answering graph-attention-networks compositional-attention-networks gqa

Updated Jan 29, 2021
Python

shreyansh26 / multihead-latent-attention

Star

A code deep-dive on one of the key innovations from Deepseek - Multihead Latent Attention (MLA)

attention mha mla multihead-attention gqa mqa multihead-latent-attention

Updated Nov 9, 2025
Python

shaheennabi / Group-Query-Attention

Sponsor

Star

This repository contains an implementation of Group Query Attention (GQA), an efficient variant of multi-head attention used in modern transformer models like LLaMA.

transformer-architecture gqa kv-cache group-query-attention