From b1dd894768b51cf475fe4d5615c970d393401e2b Mon Sep 17 00:00:00 2001
From: shivasurya <s.shivasurya@gmail.com>
Date: Sat, 2 May 2026 10:46:41 -0400
Subject: [PATCH] =?UTF-8?q?feat(graph):=20C++=20parser=20=E2=80=94=20class?=
 =?UTF-8?q?es,=20namespaces,=20templates,=20exception=20flow?=
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Add the C++ AST → graph.Node converter. Builds on the C parser (parser_c.go)
and the shared graph/clike helpers; together they give a .cpp project a
fully populated CodeGraph with Language="cpp" on every node.

# parser_cpp.go (new, ~890 lines)

C and C++ live in separate files. Where the AST shape genuinely differs
(classes, namespaces, templates, throw/try, methods inside class bodies),
parser_cpp.go has its own dedicated parse functions. Where the AST shape
is identical (variable declarations, #include directives), the existing
parseCLikeDeclaration / parseCLikeInclude in parser_c.go are reused via
their isCpp flag — no duplication. Where the shape is similar but the
language tag differs (struct/enum/typedef/call), parser_cpp.go has its
own thin parse functions to keep semantics-by-language explicit.

C++-specific node types added:
- class_declaration       (Name, SuperClass, Metadata[inheritance])
- method_declaration      (inline class methods + out-of-line decls)
- field_declaration       (class data members)
- namespace_definition    (PackageName context for contained nodes)
- template_declaration    (Metadata[template_params])
- ThrowStmt               (Metadata[throw_expression])
- TryStmt                 (Metadata[catch_clauses] = []string)

Notable features:
- Pure virtual: pure_virtual_clause → Metadata[is_pure_virtual]
- Override: virtual_specifier "override" → Metadata[is_override]
- Virtual: "virtual" keyword child → Metadata[is_virtual]
- Destructors: ~ClassName parsed as method, Metadata[is_destructor]
- Multi-inheritance: Metadata[inheritance] = ["public Animal", "private Logger"]
- Anonymous namespaces: Name="" with PackageName empty
- Nested namespaces: PackageName builds outer::inner chain
- Access propagation: access_specifier siblings update class node's
  Metadata[current_access]; subsequent fields/methods read it for Modifier
- Out-of-line method definitions: qualified declarator name kept as-is
  (Foo::bar) — call-graph builder will link to the inline declaration

# parser.go (modified)

Two existing cases gained a C++ branch (function_definition, call_expression)
via tidy `switch {}` chains. Five new cases for C++-only node types
(class_specifier, namespace_definition, template_declaration,
throw_statement, try_statement, access_specifier). The struct_specifier,
enum_specifier, type_definition, declaration, preproc_include cases now
dispatch to the C-flavour or C++-flavour parse function based on file type.

field_declaration dispatch fixed: previously parseJavaVariableDeclaration
ran for every field_declaration regardless of language, producing
Java-tagged variable_declaration nodes inside C struct bodies. Now C
files skip the case (struct fields are extracted in parseCStructSpecifier
via clike), C++ files route to parseCppFieldDeclaration, Java files keep
their existing handling.

Java-only handlers gated by isJavaSourceFile: block, yield_statement,
if_statement, while_statement, do_statement, for_statement,
binary_expression, class_declaration, block_comment. These were running
on every file's AST and producing Java-tagged nodes for C/C++ code that
happens to share the same node type names. Each guard is a single-line
addition that fixes cross-language pollution without touching the Java
parser internals.

# parser_c.go (touched)

parseCLikeDeclaration now delegates destructor-shaped declarations
(`~ClassName();` inside a class body) to
emitCppMethodDeclarationFromDeclaration in parser_cpp.go when in class
context. Previously these were emitted as function_definition with
is_declaration=true; they should be method_declaration. The dispatch is
a single class-context check guarded by the existing isCpp flag.

childrenByFieldName renamed to childDeclarators (only ever called with
"declarator" — unparam linter caught the dead generality).

# Tests

parser_cpp_test.go::TestParseCppEndToEnd parses testdata/cpp/ as a real
C++ project via Initialize() and asserts every gap-analysis point from
the tech spec:

- class_declarations_with_inheritance (Dog : public Animal)
- namespace_propagates_to_classes (Dog.PackageName = "mylib")
- anonymous_namespace_has_no_name
- method_declarations_with_access_and_override (public/private + override)
- destructor_recognised_as_method (~Animal with is_destructor)
- class_field_declaration (Dog.age, public)
- template_parameters_recorded (typename T)
- throw_statement and try_statement_with_catch_types
- call_expressions_with_shapes (dot, qualified — arrow tested separately)
- scoped_enum_marked (enum class Color)
- typedef_recorded_with_cpp_language
- struct_with_cpp_language (Point)
- forward_declarations_in_header (buffer.hpp)
- regression_no_java_tagged_nodes_in_cpp_files

Plus targeted unit tests for the defensive paths (forward class
declaration, catch (...), nil template list, recordAccessSpecifier
outside class).

graph_test.go updated:
- TestInitializeWithNonJavaFiles: .cpp is now parsed (this PR's whole
  point), so the assertion changed from "1 node" to "Java class +
  C++ function both present"
- TestBuildGraphFromASTPython{FunctionDefinition,ClassDefinition}: the
  Java BlockStmt leak that artificially inflated the Python node counts
  is now fixed by the parser.go guards; expectedNodeCount values
  reduced to match the now-correct reality. The test still verifies
  function name, parameters, and isPython flag — what it actually
  intends to assert.

Co-Authored-By: Claude <noreply@anthropic.com>
---
 sast-engine/graph/graph_test.go              |  29 +-
 sast-engine/graph/parser.go                  | 120 ++-
 sast-engine/graph/parser_c.go                |  83 +-
 sast-engine/graph/parser_c_test.go           |   4 +-
 sast-engine/graph/parser_cpp.go              | 901 +++++++++++++++++++
 sast-engine/graph/parser_cpp_test.go         | 400 ++++++++
 sast-engine/{graph => }/testdata/c/buffer.h  |   0
 sast-engine/{graph => }/testdata/c/example.c |   0
 sast-engine/testdata/cpp/buffer.hpp          |  16 +
 sast-engine/testdata/cpp/example.cpp         |  67 ++
 10 files changed, 1557 insertions(+), 63 deletions(-)
 create mode 100644 sast-engine/graph/parser_cpp.go
 create mode 100644 sast-engine/graph/parser_cpp_test.go
 rename sast-engine/{graph => }/testdata/c/buffer.h (100%)
 rename sast-engine/{graph => }/testdata/c/example.c (100%)
 create mode 100644 sast-engine/testdata/cpp/buffer.hpp
 create mode 100644 sast-engine/testdata/cpp/example.cpp

diff --git a/sast-engine/graph/graph_test.go b/sast-engine/graph/graph_test.go
index 14d43faa..291302dc 100644
--- a/sast-engine/graph/graph_test.go
+++ b/sast-engine/graph/graph_test.go
@@ -589,15 +589,18 @@ func TestInitializeWithNonJavaFiles(t *testing.T) {
 		t.Fatal("Initialize returned nil graph")
 	}
 
-	expectedNodeCount := 1 // Only one Java file
-	if len(graph.Nodes) != expectedNodeCount {
-		t.Errorf("Expected %d node, but got %d", expectedNodeCount, len(graph.Nodes))
-	}
-
+	// .java and .cpp are both parsed; only the .txt is ignored.
+	// The Java file produces a class_declaration; the C++ file produces a
+	// function_definition for `main`.
+	byType := map[string]int{}
 	for _, node := range graph.Nodes {
-		if node.Type != "class_declaration" {
-			t.Errorf("Expected node type to be 'class', but got '%s'", node.Type)
-		}
+		byType[node.Type]++
+	}
+	if byType["class_declaration"] == 0 {
+		t.Errorf("expected at least one class_declaration node from File1.java; got %v", byType)
+	}
+	if byType["function_definition"] == 0 {
+		t.Errorf("expected at least one function_definition node from File3.cpp; got %v", byType)
 	}
 }
 
@@ -1043,7 +1046,7 @@ func TestBuildGraphFromASTPythonFunctionDefinition(t *testing.T) {
 			name: "Function with parameters",
 			sourceCode: `def add(x, y):
     return x + y`,
-			expectedNodeCount: 2, // function + return
+			expectedNodeCount: 1, // function (Python parsers do not yet emit a Node for return inside this fixture)
 			expectedName:      "add",
 			expectedParams:    []string{"x", "y"},
 		},
@@ -1051,7 +1054,7 @@ func TestBuildGraphFromASTPythonFunctionDefinition(t *testing.T) {
 			name: "Method with self parameter",
 			sourceCode: `def method(self, arg1, arg2):
     self.value = arg1`,
-			expectedNodeCount: 2, // function + assignment
+			expectedNodeCount: 1, // function only — assignment recursion below the function does not produce a graph.Node here
 			expectedName:      "method",
 			expectedParams:    []string{"self", "arg1", "arg2"},
 		},
@@ -1059,7 +1062,7 @@ func TestBuildGraphFromASTPythonFunctionDefinition(t *testing.T) {
 			name: "Function with default parameters",
 			sourceCode: `def func_with_defaults(x, y=10, z=20):
     return x + y + z`,
-			expectedNodeCount: 2, // function + return
+			expectedNodeCount: 1, // function only
 			expectedName:      "func_with_defaults",
 			expectedParams:    []string{"x", "y=10", "z=20"}, // Parser captures default values
 		},
@@ -1157,7 +1160,7 @@ func TestBuildGraphFromASTPythonClassDefinition(t *testing.T) {
 			sourceCode: `class MyClass:
     def my_method(self):
         return 42`,
-			expectedNodeCount: 3, // class + method + return
+			expectedNodeCount: 2, // class + method
 			expectedClassName: "MyClass",
 			expectedBases:     []string{},
 		},
@@ -1166,7 +1169,7 @@ func TestBuildGraphFromASTPythonClassDefinition(t *testing.T) {
 			sourceCode: `class Person:
     def __init__(self, name):
         self.name = name`,
-			expectedNodeCount: 3, // class + __init__ + assignment
+			expectedNodeCount: 2, // class + __init__
 			expectedClassName: "Person",
 			expectedBases:     []string{},
 		},
diff --git a/sast-engine/graph/parser.go b/sast-engine/graph/parser.go
index eac0bfbd..314479be 100644
--- a/sast-engine/graph/parser.go
+++ b/sast-engine/graph/parser.go
@@ -18,9 +18,12 @@ func buildGraphFromAST(node *sitter.Node, sourceCode []byte, graph *CodeGraph, c
 	// language. C/C++ branches come first because the dispatcher is the
 	// only place these node types are handled.
 	case "function_definition":
-		if isCFile {
+		switch {
+		case isCFile:
 			currentContext = parseCFunctionDefinition(node, sourceCode, graph, file)
-		} else if isPythonSourceFile {
+		case isCppFile:
+			currentContext = parseCppFunctionDefinition(node, sourceCode, graph, file, currentContext)
+		case isPythonSourceFile:
 			currentContext = parsePythonFunctionDefinition(node, sourceCode, graph, file, currentContext)
 		}
 
@@ -59,33 +62,50 @@ func buildGraphFromAST(node *sitter.Node, sourceCode []byte, graph *CodeGraph, c
 			parsePythonAssignment(node, sourceCode, graph, file, currentContext)
 		}
 
-	// Java-specific node types
+	// Java-specific node types. The C/C++ grammars emit several of these
+	// same node types (block, if/while/do/for, binary_expression) with
+	// different AST shapes, so the Java handlers must be gated by language —
+	// otherwise they pollute C/C++ graphs with Java-tagged nodes.
 	case "block":
-		parseBlockStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		if isJavaSourceFile {
+			parseBlockStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		}
 
 	case "yield_statement":
-		parseYieldStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		if isJavaSourceFile {
+			parseYieldStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		}
 
 	case "if_statement":
-		parseIfStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		if isJavaSourceFile {
+			parseIfStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		}
 		if isGoSourceFile {
 			parseGoIfStatement(node, sourceCode, graph, file)
 		}
 
 	case "while_statement":
-		parseWhileStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		if isJavaSourceFile {
+			parseWhileStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		}
 
 	case "do_statement":
-		parseDoStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		if isJavaSourceFile {
+			parseDoStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		}
 
 	case "for_statement":
-		parseForStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		if isJavaSourceFile {
+			parseForStatement(node, sourceCode, graph, file, isJavaSourceFile)
+		}
 		if isGoSourceFile {
 			parseGoForStatement(node, sourceCode, graph, file)
 		}
 
 	case "binary_expression":
-		currentContext = parseJavaBinaryExpression(node, sourceCode, graph, file, isJavaSourceFile)
+		if isJavaSourceFile {
+			currentContext = parseJavaBinaryExpression(node, sourceCode, graph, file, isJavaSourceFile)
+		}
 
 	case "method_declaration":
 		if isJavaSourceFile {
@@ -98,14 +118,32 @@ func buildGraphFromAST(node *sitter.Node, sourceCode []byte, graph *CodeGraph, c
 		parseJavaMethodInvocation(node, sourceCode, graph, currentContext, file)
 
 	case "class_declaration":
-		parseJavaClassDeclaration(node, sourceCode, graph, file)
+		if isJavaSourceFile {
+			parseJavaClassDeclaration(node, sourceCode, graph, file)
+		}
 
 	case "block_comment":
-		parseJavaBlockComment(node, sourceCode, graph, file)
+		if isJavaSourceFile {
+			parseJavaBlockComment(node, sourceCode, graph, file)
+		}
 
-	case "local_variable_declaration", "field_declaration":
+	case "local_variable_declaration":
 		parseJavaVariableDeclaration(node, sourceCode, graph, file)
 
+	case "field_declaration":
+		// tree-sitter overloads field_declaration:
+		//   - Java: class fields (handled by parseJavaVariableDeclaration)
+		//   - C:    struct fields (handled by parseCStructSpecifier via clike;
+		//           the bare nodes here are siblings of an already-recorded
+		//           struct, so we skip them to avoid duplicate nodes)
+		//   - C++:  data members AND inline method declarations inside a
+		//           class body (handled by parseCppFieldDeclaration)
+		if isCppFile {
+			parseCppFieldDeclaration(node, sourceCode, graph, file, currentContext)
+		} else if !isCFile {
+			parseJavaVariableDeclaration(node, sourceCode, graph, file)
+		}
+
 	case "object_creation_expression":
 		parseJavaObjectCreation(node, sourceCode, graph, file)
 
@@ -121,28 +159,37 @@ func buildGraphFromAST(node *sitter.Node, sourceCode []byte, graph *CodeGraph, c
 		}
 
 	case "call_expression":
-		if isCFile {
+		switch {
+		case isCFile:
 			parseCCallExpression(node, sourceCode, graph, file, currentContext)
-		} else if isGoSourceFile {
+		case isCppFile:
+			parseCppCallExpression(node, sourceCode, graph, file, currentContext)
+		case isGoSourceFile:
 			parseGoCallExpression(node, sourceCode, graph, file, currentContext)
 		}
 
-	// C/C++ specific node types. struct_specifier appears in C only at the
-	// top level (C++ uses class_specifier for the equivalent construct);
-	// the remaining four are shared between C and C++.
+	// C and C++ shared node types — each language has its own parse
+	// function that sets the right Language tag and handles language-
+	// specific concerns (e.g. C++ struct inheritance via base_class_clause).
 	case "struct_specifier":
 		if isCFile {
 			parseCStructSpecifier(node, sourceCode, graph, file)
+		} else if isCppFile {
+			parseCppStructSpecifier(node, sourceCode, graph, file)
 		}
 
 	case "enum_specifier":
-		if isCFile || isCppFile {
+		if isCFile {
 			parseCEnumSpecifier(node, sourceCode, graph, file)
+		} else if isCppFile {
+			parseCppEnumSpecifier(node, sourceCode, graph, file)
 		}
 
 	case "type_definition":
-		if isCFile || isCppFile {
+		if isCFile {
 			parseCTypeDefinition(node, sourceCode, graph, file)
+		} else if isCppFile {
+			parseCppTypeDefinition(node, sourceCode, graph, file)
 		}
 
 	case "declaration":
@@ -155,6 +202,39 @@ func buildGraphFromAST(node *sitter.Node, sourceCode []byte, graph *CodeGraph, c
 			parseCLikeInclude(node, sourceCode, graph, file, isCppFile)
 		}
 
+	// C++-only node types. The dispatcher returns the new node from
+	// class_specifier and namespace_definition so the recursion picks up
+	// the surrounding scope as currentContext for member resolution.
+	case "class_specifier":
+		if isCppFile {
+			currentContext = parseCppClassSpecifier(node, sourceCode, graph, file, currentContext)
+		}
+
+	case "namespace_definition":
+		if isCppFile {
+			currentContext = parseCppNamespaceDefinition(node, sourceCode, graph, file, currentContext)
+		}
+
+	case "template_declaration":
+		if isCppFile {
+			parseCppTemplateDeclaration(node, sourceCode, graph, file)
+		}
+
+	case "throw_statement":
+		if isCppFile {
+			parseCppThrowStatement(node, sourceCode, graph, file, currentContext)
+		}
+
+	case "try_statement":
+		if isCppFile {
+			parseCppTryStatement(node, sourceCode, graph, file, currentContext)
+		}
+
+	case "access_specifier":
+		if isCppFile {
+			recordAccessSpecifier(node, sourceCode, currentContext)
+		}
+
 	case "short_var_declaration":
 		if isGoSourceFile {
 			parseGoShortVarDeclaration(node, sourceCode, graph, file)
diff --git a/sast-engine/graph/parser_c.go b/sast-engine/graph/parser_c.go
index e0ac2902..3e6e6dbe 100644
--- a/sast-engine/graph/parser_c.go
+++ b/sast-engine/graph/parser_c.go
@@ -7,17 +7,23 @@ import (
 	sitter "github.com/smacker/go-tree-sitter"
 )
 
-// parser_c.go converts tree-sitter C AST nodes into graph.Node objects. The
-// dispatcher in buildGraphFromAST (parser.go) selects the parse functions
-// declared here for files whose extension routes them to the tree-sitter C
-// grammar — every entry point sets Language="c" on the produced node.
+// parser_c.go converts tree-sitter C AST nodes into graph.Node objects.
+// The dispatcher in buildGraphFromAST (parser.go) selects the parse
+// functions declared here for files whose extension routes them to the
+// tree-sitter C grammar — every entry point sets Language="c" on the
+// produced node.
 //
-// All AST extraction (function metadata, type strings, parameter lists,
-// struct fields, call info) goes through the graph/clike helper package
-// so that parser_cpp.go (PR-04) can reuse the same primitives without
-// duplicating logic. Two helpers in this file — parseCLikeDeclaration and
-// parseCLikeInclude — are deliberately language-neutral and accept an
-// isCpp flag so the C++ parser can call them directly.
+// C++ has its own dispatcher (parser_cpp.go) — separate file, separate
+// constructs (classes, namespaces, templates, throw/try). The two parsers
+// share the AST extraction primitives in graph/clike (function metadata,
+// type strings, parameter lists, struct fields, call info), and a small
+// set of language-neutral helpers (childrenByFieldName, bareIdentifierName,
+// extractTaggedName, lineRange, newSourceLocation, scopeFromContext,
+// languageOfFile, normaliseIncludePath) that live at the bottom of this
+// file. Two parse functions — parseCLikeDeclaration and parseCLikeInclude —
+// are deliberately language-neutral and accept an isCpp flag because the
+// AST shape for variable declarations and #include directives is identical
+// between the two grammars.
 
 // Language tags used as Node.Language values for the C/C++ parsers.
 const (
@@ -25,7 +31,9 @@ const (
 	languageCpp = "cpp"
 )
 
-// Node.Type values produced by the C/C++ parsers.
+// Node.Type values produced by the C parser. C++-only types
+// (method_declaration, class_declaration, ThrowStmt, TryStmt, …) live in
+// parser_cpp.go.
 const (
 	nodeTypeFunctionDefinition = "function_definition"
 	nodeTypeStructDeclaration  = "struct_declaration"
@@ -36,9 +44,11 @@ const (
 	nodeTypeIncludeStatement   = "include_statement"
 )
 
-// Metadata keys used by the C/C++ parsers. Keeping these as constants makes
-// downstream consumers (rule writers, the call-graph builder) discoverable
-// and prevents key drift.
+// Metadata keys produced by the C parser. The keys covering call-shape
+// detection (is_method / is_arrow / is_qualified / receiver) are shared
+// with the C++ parser because clike.CallInfo classifies all four call
+// shapes regardless of language — C can also produce arrow-method calls
+// through function pointers.
 const (
 	metaIsDeclaration  = "is_declaration"
 	metaSystemInclude  = "system_include"
@@ -46,6 +56,10 @@ const (
 	metaEnumerators    = "enumerators"
 	metaUnderlyingType = "underlying_type"
 	metaIsAnonymous    = "is_anonymous"
+	metaIsMethod       = "is_method"
+	metaIsArrow        = "is_arrow"
+	metaIsQualified    = "is_qualified"
+	metaReceiver       = "receiver"
 )
 
 // =============================================================================
@@ -197,7 +211,7 @@ func parseCTypeDefinition(node *sitter.Node, sourceCode []byte, graph *CodeGraph
 		underlying = strings.TrimSpace(typeNode.Content(sourceCode))
 	}
 
-	for _, declarator := range childrenByFieldName(node, "declarator") {
+	for _, declarator := range childDeclarators(node) {
 		aliasName := bareIdentifierName(declarator, sourceCode)
 		if aliasName == "" {
 			aliasName = strings.TrimSpace(declarator.Content(sourceCode))
@@ -241,7 +255,19 @@ func parseCLikeDeclaration(node *sitter.Node, sourceCode []byte, graph *CodeGrap
 	// function_definition node so callers and call-graph builders find it
 	// alongside actual definitions; Metadata["is_declaration"] = true
 	// distinguishes the prototype from a body-bearing definition.
+	//
+	// In C++, the same shape is used by tree-sitter for destructors and
+	// inline method declarations that don't go through field_declaration
+	// (e.g. `~ClassName();` inside a class body). When we are in class
+	// context we route to the C++ helper which emits a method_declaration
+	// node instead — keeping rule writers free of dispatch concerns.
 	if isFunctionPrototype(node) {
+		if isCpp {
+			if classNode := classFromContext(currentContext); classNode != nil {
+				emitCppMethodDeclarationFromDeclaration(node, sourceCode, graph, file, classNode)
+				return
+			}
+		}
 		emitFunctionDeclaration(node, sourceCode, graph, file, isCpp)
 		return
 	}
@@ -251,7 +277,7 @@ func parseCLikeDeclaration(node *sitter.Node, sourceCode []byte, graph *CodeGrap
 	language := languageOfFile(isCpp)
 	lineNumber := node.StartPoint().Row + 1
 
-	for _, declarator := range childrenByFieldName(node, "declarator") {
+	for _, declarator := range childDeclarators(node) {
 		name, valueText := bareIdentifierAndInitialiser(declarator, sourceCode)
 		if name == "" {
 			continue
@@ -279,7 +305,7 @@ func parseCLikeDeclaration(node *sitter.Node, sourceCode []byte, graph *CodeGrap
 // any declarator being a function_declarator is enough to treat the whole
 // declaration as a prototype, which matches what real C codebases do.
 func isFunctionPrototype(node *sitter.Node) bool {
-	for _, declarator := range childrenByFieldName(node, "declarator") {
+	for _, declarator := range childDeclarators(node) {
 		for cur := declarator; cur != nil; cur = cur.ChildByFieldName("declarator") {
 			if cur.Type() == "function_declarator" {
 				return true
@@ -341,16 +367,16 @@ func parseCCallExpression(node *sitter.Node, sourceCode []byte, graph *CodeGraph
 
 	metadata := map[string]any{}
 	if info.IsMethod {
-		metadata["is_method"] = true
+		metadata[metaIsMethod] = true
 	}
 	if info.IsArrow {
-		metadata["is_arrow"] = true
+		metadata[metaIsArrow] = true
 	}
 	if info.IsQualified {
-		metadata["is_qualified"] = true
+		metadata[metaIsQualified] = true
 	}
 	if info.Receiver != "" {
-		metadata["receiver"] = info.Receiver
+		metadata[metaReceiver] = info.Receiver
 	}
 
 	callNode := &Node{
@@ -421,15 +447,16 @@ func collectStorageClassSpecifiers(node *sitter.Node, sourceCode []byte) []strin
 	return classes
 }
 
-// childrenByFieldName returns every direct child of node whose field name
-// matches name. tree-sitter exposes ChildByFieldName for the *first* match
-// only, but several C constructs (declaration with multiple init_declarators,
-// type_definition with multiple alias names) repeat the same field — this
-// helper iterates the full child list and yields all of them in order.
-func childrenByFieldName(node *sitter.Node, name string) []*sitter.Node {
+// childDeclarators returns every direct child of node whose field name is
+// "declarator". tree-sitter's stdlib ChildByFieldName returns the *first*
+// match only, but several C/C++ constructs (declaration with multiple
+// init_declarators, type_definition with multiple alias names) repeat the
+// same field — this helper iterates the full child list and yields all
+// of them in order.
+func childDeclarators(node *sitter.Node) []*sitter.Node {
 	var matches []*sitter.Node
 	for i := 0; i < int(node.ChildCount()); i++ {
-		if node.FieldNameForChild(i) == name {
+		if node.FieldNameForChild(i) == "declarator" {
 			if c := node.Child(i); c != nil {
 				matches = append(matches, c)
 			}
diff --git a/sast-engine/graph/parser_c_test.go b/sast-engine/graph/parser_c_test.go
index 5c7d915f..61070e13 100644
--- a/sast-engine/graph/parser_c_test.go
+++ b/sast-engine/graph/parser_c_test.go
@@ -51,7 +51,7 @@ func findFirstNodeOfType(node *sitter.Node, nodeType string) *sitter.Node {
 	return nil
 }
 
-// TestParseCEndToEnd parses testdata/c/ as a complete project via Initialize
+// TestParseCEndToEnd parses ../testdata/c/ as a complete project via Initialize
 // and asserts that every node type the C parser is responsible for —
 // function definitions, forward declarations, structs, enums, typedefs,
 // variable declarations, includes, and call expressions — produces graph
@@ -64,7 +64,7 @@ func findFirstNodeOfType(node *sitter.Node, nodeType string) *sitter.Node {
 // parser.go. If a future refactor moves parse functions into a subpackage
 // or changes the dispatch order, this test will catch behavioural drift.
 func TestParseCEndToEnd(t *testing.T) {
-	graph := Initialize("testdata/c", nil)
+	graph := Initialize("../testdata/c", nil)
 	if graph == nil {
 		t.Fatal("Initialize returned nil")
 	}
diff --git a/sast-engine/graph/parser_cpp.go b/sast-engine/graph/parser_cpp.go
new file mode 100644
index 00000000..c3ef59ec
--- /dev/null
+++ b/sast-engine/graph/parser_cpp.go
@@ -0,0 +1,901 @@
+package graph
+
+import (
+	"strings"
+
+	"github.com/shivasurya/code-pathfinder/sast-engine/graph/clike"
+	sitter "github.com/smacker/go-tree-sitter"
+)
+
+// parser_cpp.go converts tree-sitter C++ AST nodes into graph.Node objects.
+// Every entry point sets Language="cpp" on the produced node.
+//
+// C and C++ live in separate files (parser_c.go vs parser_cpp.go) so that
+// language-specific concerns — class hierarchies, namespaces, templates,
+// access specifiers, exception flow — stay isolated. The two parsers share:
+//
+//   - graph/clike  — language-neutral AST extraction (function info, type
+//     strings, parameter lists, struct fields, call info)
+//   - language-neutral helpers in parser_c.go (childrenByFieldName,
+//     bareIdentifierName, extractTaggedName, lineRange, newSourceLocation,
+//     scopeFromContext, languageOfFile, normaliseIncludePath,
+//     collectStorageClassSpecifiers)
+//   - parseCLikeDeclaration / parseCLikeInclude in parser_c.go for the two
+//     constructs whose AST shape is identical between the two grammars
+//     (variable declarations and #include directives)
+//
+// Within this file, function-definition / class / field / call dispatchers
+// rely on currentContext to decide whether a node represents a method
+// inside a class, a function inside a namespace, or a free function — the
+// dispatcher in parser.go threads currentContext through buildGraphFromAST
+// recursion so each handler has the surrounding scope available.
+
+// Node.Type values produced by the C++ parser. Keep these next to the
+// parser that emits them so adding new construct support touches one file
+// only.
+const (
+	nodeTypeMethodDeclaration = "method_declaration"
+	nodeTypeClassDeclaration  = "class_declaration"
+	nodeTypeFieldDecl         = "field_declaration"
+	nodeTypeThrowStmt         = "ThrowStmt"
+	nodeTypeTryStmt           = "TryStmt"
+)
+
+// Metadata keys produced by the C++ parser. The shared keys (is_method,
+// is_arrow, is_qualified, receiver) live in parser_c.go because clike
+// classifies all four call shapes regardless of language.
+const (
+	metaTemplateParams = "template_params"
+	metaCurrentAccess  = "current_access"
+	metaNamespace      = "namespace"
+	metaThrowExpr      = "throw_expression"
+	metaCatchClauses   = "catch_clauses"
+	metaIsDestructor   = "is_destructor"
+	metaIsVirtual      = "is_virtual"
+	metaIsPureVirtual  = "is_pure_virtual"
+	metaIsOverride     = "is_override"
+	metaInheritance    = "inheritance" // []string{"public Animal", "private Logger"}
+)
+
+// =============================================================================
+// Class declarations
+// =============================================================================
+
+// parseCppClassSpecifier records a class_specifier and returns the class
+// node so the dispatcher in parser.go can use it as currentContext for the
+// recursion into the class body. Subsequent access_specifier nodes seen
+// during recursion update Metadata[metaCurrentAccess] on the same map; the
+// field/method handlers read that value to populate Modifier.
+//
+// Inheritance is captured from base_class_clause: SuperClass holds the
+// first base class's bare name (matching the existing graph.Node convention),
+// and Metadata["inheritance"] holds the full list with access specifiers
+// (e.g. ["public Animal", "private Logger"]) so multi-inheritance rules can
+// see every base.
+//
+// Anonymous structs/classes used as inline type expressions (e.g. inside
+// `typedef struct { ... } X;`) carry empty Name and
+// Metadata["is_anonymous"] = true.
+func parseCppClassSpecifier(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, currentContext *Node) *Node {
+	body := node.ChildByFieldName("body")
+	if body == nil {
+		// `class Foo` used as a forward declaration or type reference. The
+		// referencing site (declaration / parameter) carries the type
+		// information; we do not record an empty class node.
+		return nil
+	}
+
+	name, isAnonymous := extractTaggedName(node, sourceCode)
+	superClass, inheritance := extractBaseClasses(node, sourceCode)
+
+	metadata := map[string]any{}
+	if isAnonymous {
+		metadata[metaIsAnonymous] = true
+	}
+	if len(inheritance) > 0 {
+		metadata[metaInheritance] = inheritance
+	}
+
+	classNode := &Node{
+		ID:             GenerateSha256("class:" + name + "@" + file + "#" + lineRange(node)),
+		Type:           nodeTypeClassDeclaration,
+		Name:           name,
+		LineNumber:     node.StartPoint().Row + 1,
+		SuperClass:     superClass,
+		File:           file,
+		Language:       languageCpp,
+		PackageName:    packageNameFromContext(currentContext),
+		SourceLocation: newSourceLocation(file, node),
+		Metadata:       metadata,
+	}
+	g.AddNode(classNode)
+	return classNode
+}
+
+// extractBaseClasses parses a class_specifier's base_class_clause and
+// returns (firstBareName, inheritanceEntries). The first bare name is the
+// most common access pattern (single-inheritance), exposed via
+// graph.Node.SuperClass; inheritanceEntries preserves access specifiers
+// and ordering for the rare but important multi-inheritance case.
+func extractBaseClasses(class *sitter.Node, sourceCode []byte) (string, []string) {
+	var clause *sitter.Node
+	for i := 0; i < int(class.NamedChildCount()); i++ {
+		c := class.NamedChild(i)
+		if c != nil && c.Type() == "base_class_clause" {
+			clause = c
+			break
+		}
+	}
+	if clause == nil {
+		return "", nil
+	}
+
+	var firstBare string
+	var entries []string
+	currentAccess := ""
+	for i := 0; i < int(clause.NamedChildCount()); i++ {
+		child := clause.NamedChild(i)
+		if child == nil {
+			continue
+		}
+		switch child.Type() {
+		case "access_specifier":
+			currentAccess = strings.TrimSpace(child.Content(sourceCode))
+		case "type_identifier", "qualified_identifier", "template_type":
+			bare := strings.TrimSpace(child.Content(sourceCode))
+			if firstBare == "" {
+				firstBare = bare
+			}
+			if currentAccess != "" {
+				entries = append(entries, currentAccess+" "+bare)
+				currentAccess = ""
+			} else {
+				entries = append(entries, bare)
+			}
+		}
+	}
+	return firstBare, entries
+}
+
+// =============================================================================
+// Namespaces
+// =============================================================================
+
+// parseCppNamespaceDefinition records a namespace as a context node. The
+// returned node is consumed by buildGraphFromAST as currentContext so every
+// declaration inside the namespace body inherits PackageName. Anonymous
+// namespaces (no name child) emit a node with Name="" and PackageName="" —
+// rule writers can still inspect Type==namespace_definition while leaving
+// FQNs unqualified.
+//
+// Inline namespaces (`inline namespace x { ... }`) parse as namespace_definition
+// in tree-sitter; for Phase 1 we treat them identically to regular namespaces
+// since their visibility-promotion semantics do not change FQN construction.
+func parseCppNamespaceDefinition(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, currentContext *Node) *Node {
+	name := ""
+	if nameNode := node.ChildByFieldName("name"); nameNode != nil {
+		name = nameNode.Content(sourceCode)
+	}
+
+	// Compose nested namespace name for the contextual PackageName: an
+	// inner namespace inherits the outer namespace's prefix.
+	prefix := packageNameFromContext(currentContext)
+	combined := name
+	if prefix != "" {
+		if name == "" {
+			combined = prefix
+		} else {
+			combined = prefix + "::" + name
+		}
+	}
+
+	nsNode := &Node{
+		ID:             GenerateSha256("namespace:" + combined + "@" + file + "#" + lineRange(node)),
+		Type:           "namespace_definition",
+		Name:           name,
+		LineNumber:     node.StartPoint().Row + 1,
+		PackageName:    combined,
+		File:           file,
+		Language:       languageCpp,
+		SourceLocation: newSourceLocation(file, node),
+		Metadata:       map[string]any{metaNamespace: combined},
+	}
+	g.AddNode(nsNode)
+	return nsNode
+}
+
+// =============================================================================
+// Function definitions and method declarations
+// =============================================================================
+
+// parseCppFunctionDefinition records a function_definition. When
+// currentContext is a class, the produced node is a method_declaration;
+// otherwise it is a function_definition. Pure virtual methods (Animal::speak
+// = 0) carry Metadata[is_pure_virtual]=true; virtual methods carry
+// is_virtual; override-marked methods carry is_override.
+//
+// The function is also reached for out-of-line method definitions
+// (`void Foo::bar() { ... }` at translation-unit scope). In that case the
+// declarator chain produces a qualified name like "Foo::bar" via
+// clike.ExtractFunctionInfo; we keep the qualified form in Name and emit
+// function_definition (not method_declaration) because the surrounding
+// context is not a class node — the call-graph builder in a later PR can
+// link these to their class declarations by FQN.
+func parseCppFunctionDefinition(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, currentContext *Node) *Node {
+	info := clike.ExtractFunctionInfo(node, sourceCode)
+	if info == nil {
+		return nil
+	}
+
+	storageClasses := collectStorageClassSpecifiers(node, sourceCode)
+	specifiers := collectVirtualSpecifiers(node, sourceCode)
+	hasPureVirtual := node.ChildByFieldName("default_value") != nil || hasChildOfType(node, "pure_virtual_clause")
+
+	insideClass := classFromContext(currentContext)
+	nodeType := nodeTypeFunctionDefinition
+	modifier := strings.Join(storageClasses, " ")
+	packageName := packageNameFromContext(currentContext)
+
+	if insideClass != nil {
+		nodeType = nodeTypeMethodDeclaration
+		access := classAccessFromContext(insideClass)
+		modifier = combineModifiers(access, storageClasses)
+	}
+
+	metadata := map[string]any{}
+	if info.IsDeclaration {
+		metadata[metaIsDeclaration] = true
+	}
+	if len(storageClasses) > 0 {
+		metadata[metaStorageClasses] = storageClasses
+	}
+	if specifiers["virtual"] {
+		metadata[metaIsVirtual] = true
+	}
+	if specifiers["override"] {
+		metadata[metaIsOverride] = true
+	}
+	if hasPureVirtual {
+		metadata[metaIsPureVirtual] = true
+		// A pure virtual still has no body — flag it as a declaration too
+		// so callers can treat the two forms uniformly.
+		metadata[metaIsDeclaration] = true
+	}
+	if isDestructorName(info.Name) {
+		metadata[metaIsDestructor] = true
+	}
+
+	functionNode := &Node{
+		ID:                   GenerateMethodID("function:"+info.Name, info.ParamTypes, file, info.LineNumber),
+		Type:                 nodeType,
+		Name:                 info.Name,
+		LineNumber:           info.LineNumber,
+		ReturnType:           info.ReturnType,
+		MethodArgumentsType:  info.ParamTypes,
+		MethodArgumentsValue: info.ParamNames,
+		Modifier:             modifier,
+		PackageName:          packageName,
+		File:                 file,
+		Language:             languageCpp,
+		SourceLocation:       newSourceLocation(file, node),
+		Metadata:             metadata,
+	}
+	g.AddNode(functionNode)
+	return functionNode
+}
+
+// =============================================================================
+// Field declarations (class data members and inline method declarations)
+// =============================================================================
+
+// parseCppFieldDeclaration handles `field_declaration` nodes inside a class
+// body. tree-sitter overloads field_declaration for two distinct C++
+// constructs:
+//
+//   - Data members:    `int x;`           → declarator is field_identifier
+//   - Method decls:    `void bar();`      → declarator is function_declarator
+//
+// We dispatch on declarator type so each construct produces the right
+// graph.Node (field_declaration vs method_declaration).
+//
+// Outside a class body, field_declaration is not expected for C/C++; the
+// dispatcher in parser.go guards against it.
+func parseCppFieldDeclaration(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, currentContext *Node) {
+	insideClass := classFromContext(currentContext)
+	declarator := node.ChildByFieldName("declarator")
+	typeNode := node.ChildByFieldName("type")
+
+	if isFunctionDeclarator(declarator) {
+		emitMethodDeclarationFromField(node, sourceCode, g, file, insideClass)
+		return
+	}
+
+	name := bareIdentifierName(declarator, sourceCode)
+	if name == "" {
+		return
+	}
+
+	access := classAccessFromContext(insideClass)
+	dataType := clike.ExtractTypeString(typeNode, declarator, sourceCode)
+
+	metadata := map[string]any{}
+	if access != "" {
+		metadata[metaCurrentAccess] = access
+	}
+
+	g.AddNode(&Node{
+		ID:             GenerateSha256("field:" + scopedName(insideClass, name) + "@" + file + "#" + lineRange(node)),
+		Type:           nodeTypeFieldDecl,
+		Name:           name,
+		DataType:       dataType,
+		Modifier:       access,
+		PackageName:    packageNameFromContext(insideClass),
+		LineNumber:     node.StartPoint().Row + 1,
+		File:           file,
+		Language:       languageCpp,
+		SourceLocation: newSourceLocation(file, node),
+		Metadata:       metadata,
+	})
+}
+
+// emitMethodDeclarationFromField records an inline method declaration that
+// tree-sitter parses as field_declaration (e.g. `void speak() override;`
+// inside a class body, no body of its own).
+func emitMethodDeclarationFromField(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, insideClass *Node) {
+	info := clike.ExtractFunctionInfo(node, sourceCode)
+	if info == nil || info.Name == "" {
+		return
+	}
+
+	access := classAccessFromContext(insideClass)
+	specifiers := collectVirtualSpecifiers(node, sourceCode)
+	hasPureVirtual := hasChildOfType(node, "pure_virtual_clause")
+
+	metadata := map[string]any{
+		metaIsDeclaration: true,
+	}
+	if specifiers["virtual"] {
+		metadata[metaIsVirtual] = true
+	}
+	if specifiers["override"] {
+		metadata[metaIsOverride] = true
+	}
+	if hasPureVirtual {
+		metadata[metaIsPureVirtual] = true
+	}
+	if isDestructorName(info.Name) {
+		metadata[metaIsDestructor] = true
+	}
+
+	g.AddNode(&Node{
+		ID:                   GenerateMethodID("function:"+info.Name, info.ParamTypes, file, info.LineNumber),
+		Type:                 nodeTypeMethodDeclaration,
+		Name:                 info.Name,
+		LineNumber:           info.LineNumber,
+		ReturnType:           info.ReturnType,
+		MethodArgumentsType:  info.ParamTypes,
+		MethodArgumentsValue: info.ParamNames,
+		Modifier:             access,
+		PackageName:          packageNameFromContext(insideClass),
+		File:                 file,
+		Language:             languageCpp,
+		SourceLocation:       newSourceLocation(file, node),
+		Metadata:             metadata,
+	})
+}
+
+// emitCppMethodDeclarationFromDeclaration records a method declaration that
+// reaches us as a top-level `declaration` node inside a class body — this
+// is how tree-sitter parses destructors (`~ClassName();`). Called from
+// parseCLikeDeclaration in parser_c.go when isCpp && currentContext is a
+// class.
+func emitCppMethodDeclarationFromDeclaration(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, insideClass *Node) {
+	info := clike.ExtractFunctionInfo(node, sourceCode)
+	if info == nil || info.Name == "" {
+		return
+	}
+
+	access := classAccessFromContext(insideClass)
+	metadata := map[string]any{metaIsDeclaration: true}
+	if isDestructorName(info.Name) {
+		metadata[metaIsDestructor] = true
+	}
+
+	g.AddNode(&Node{
+		ID:                   GenerateMethodID("function:"+info.Name, info.ParamTypes, file, info.LineNumber),
+		Type:                 nodeTypeMethodDeclaration,
+		Name:                 info.Name,
+		LineNumber:           info.LineNumber,
+		ReturnType:           info.ReturnType,
+		MethodArgumentsType:  info.ParamTypes,
+		MethodArgumentsValue: info.ParamNames,
+		Modifier:             access,
+		PackageName:          packageNameFromContext(insideClass),
+		File:                 file,
+		Language:             languageCpp,
+		SourceLocation:       newSourceLocation(file, node),
+		Metadata:             metadata,
+	})
+}
+
+// =============================================================================
+// Templates
+// =============================================================================
+
+// parseCppTemplateDeclaration records the template parameter list. The
+// inner construct (function_definition, class_specifier, etc.) is processed
+// during normal recursion; we just attach the template parameters to the
+// inner node by walking forward at insertion time.
+//
+// The metadata is stored on the outer template_declaration node so rule
+// writers can match on "templated function" without looking at the inner
+// shape — that match is then refined by joining on the inner node via
+// SourceLocation.
+func parseCppTemplateDeclaration(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string) {
+	params := extractTemplateParameters(node.ChildByFieldName("parameters"), sourceCode)
+
+	g.AddNode(&Node{
+		ID:             GenerateSha256("template:" + strings.Join(params, ",") + "@" + file + "#" + lineRange(node)),
+		Type:           "template_declaration",
+		Name:           strings.Join(params, ","),
+		LineNumber:     node.StartPoint().Row + 1,
+		File:           file,
+		Language:       languageCpp,
+		SourceLocation: newSourceLocation(file, node),
+		Metadata:       map[string]any{metaTemplateParams: params},
+	})
+}
+
+// extractTemplateParameters reads a template_parameter_list and returns
+// each declared parameter in source order.
+//
+// Examples:
+//
+//	<typename T>             → ["T"]
+//	<typename T, int N>      → ["T", "N"]
+//	<class K, class V = int> → ["K", "V"]
+func extractTemplateParameters(list *sitter.Node, sourceCode []byte) []string {
+	if list == nil {
+		return nil
+	}
+	var params []string
+	for i := 0; i < int(list.NamedChildCount()); i++ {
+		child := list.NamedChild(i)
+		if child == nil {
+			continue
+		}
+		switch child.Type() {
+		case "type_parameter_declaration", "parameter_declaration", "optional_type_parameter_declaration":
+			// The parameter name is the last identifier / type_identifier child.
+			name := lastNamedIdentifier(child, sourceCode)
+			if name != "" {
+				params = append(params, name)
+			}
+		}
+	}
+	return params
+}
+
+// =============================================================================
+// Exception flow: throw and try/catch
+// =============================================================================
+
+// parseCppThrowStatement records a `throw expr;` statement. The expression
+// text is captured in Metadata["throw_expression"] so flow-analysis rules
+// can match on what was thrown without re-parsing the AST.
+//
+// `throw;` (re-throw) parses the same node type with no expression child;
+// Metadata["throw_expression"] is empty in that case.
+func parseCppThrowStatement(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, currentContext *Node) {
+	expr := ""
+	for i := 0; i < int(node.NamedChildCount()); i++ {
+		child := node.NamedChild(i)
+		if child != nil {
+			expr = strings.TrimSpace(child.Content(sourceCode))
+			break
+		}
+	}
+
+	throwNode := &Node{
+		ID:             GenerateSha256("throw:" + expr + "@" + file + "#" + lineRange(node)),
+		Type:           nodeTypeThrowStmt,
+		Name:           expr,
+		LineNumber:     node.StartPoint().Row + 1,
+		File:           file,
+		Language:       languageCpp,
+		SourceLocation: newSourceLocation(file, node),
+		Metadata:       map[string]any{metaThrowExpr: expr},
+	}
+	g.AddNode(throwNode)
+	if currentContext != nil {
+		g.AddEdge(currentContext, throwNode)
+	}
+}
+
+// parseCppTryStatement records a `try { ... } catch (...) { ... }` block.
+// Each catch clause's parameter type goes into Metadata["catch_clauses"]
+// as a []string so rule writers can match handlers by exception type.
+func parseCppTryStatement(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, currentContext *Node) {
+	catches := extractCatchExceptionTypes(node, sourceCode)
+
+	tryNode := &Node{
+		ID:             GenerateSha256("try@" + file + "#" + lineRange(node)),
+		Type:           nodeTypeTryStmt,
+		LineNumber:     node.StartPoint().Row + 1,
+		File:           file,
+		Language:       languageCpp,
+		SourceLocation: newSourceLocation(file, node),
+		Metadata:       map[string]any{metaCatchClauses: catches},
+	}
+	g.AddNode(tryNode)
+	if currentContext != nil {
+		g.AddEdge(currentContext, tryNode)
+	}
+}
+
+// extractCatchExceptionTypes walks every catch_clause child of a
+// try_statement and returns the type string of the caught exception in
+// each handler. `catch (...)` (catch-all) emits "..." as the entry.
+func extractCatchExceptionTypes(try *sitter.Node, sourceCode []byte) []string {
+	var types []string
+	for i := 0; i < int(try.NamedChildCount()); i++ {
+		child := try.NamedChild(i)
+		if child == nil || child.Type() != "catch_clause" {
+			continue
+		}
+		params := child.ChildByFieldName("parameters")
+		if params == nil {
+			types = append(types, "...")
+			continue
+		}
+		// catch (...) — single anonymous param.
+		paramText := strings.TrimSpace(params.Content(sourceCode))
+		if paramText == "(...)" {
+			types = append(types, "...")
+			continue
+		}
+		// Single parameter — extract its type via clike.
+		var first *sitter.Node
+		for j := 0; j < int(params.NamedChildCount()); j++ {
+			c := params.NamedChild(j)
+			if c != nil && (c.Type() == "parameter_declaration" || c.Type() == "optional_parameter_declaration") {
+				first = c
+				break
+			}
+		}
+		if first == nil {
+			types = append(types, "...")
+			continue
+		}
+		typeNode := first.ChildByFieldName("type")
+		declarator := first.ChildByFieldName("declarator")
+		types = append(types, clike.ExtractTypeString(typeNode, declarator, sourceCode))
+	}
+	return types
+}
+
+// =============================================================================
+// Calls, structs, enums, typedefs (C++ flavour)
+// =============================================================================
+
+// parseCppCallExpression records a call_expression with Language="cpp" and
+// the call-shape metadata produced by clike.ExtractCallInfo. Every call
+// shape (free function, dot method, arrow method, qualified) is handled by
+// the same code path because the classification is already done by clike.
+func parseCppCallExpression(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string, currentContext *Node) {
+	info := clike.ExtractCallInfo(node, sourceCode)
+	if info == nil {
+		return
+	}
+
+	metadata := map[string]any{}
+	if info.IsMethod {
+		metadata[metaIsMethod] = true
+	}
+	if info.IsArrow {
+		metadata[metaIsArrow] = true
+	}
+	if info.IsQualified {
+		metadata[metaIsQualified] = true
+	}
+	if info.Receiver != "" {
+		metadata[metaReceiver] = info.Receiver
+	}
+
+	callNode := &Node{
+		ID:                   GenerateSha256("call:" + info.Target + "@" + file + "#" + lineRange(node)),
+		Type:                 nodeTypeCallExpression,
+		Name:                 info.Target,
+		MethodArgumentsValue: info.Args,
+		LineNumber:           node.StartPoint().Row + 1,
+		File:                 file,
+		Language:             languageCpp,
+		SourceLocation:       newSourceLocation(file, node),
+		Metadata:             metadata,
+	}
+	g.AddNode(callNode)
+	if currentContext != nil {
+		g.AddEdge(currentContext, callNode)
+	}
+}
+
+// parseCppStructSpecifier records a C++ struct (semantically a class with
+// public default visibility). Inheritance via base_class_clause is captured
+// the same way as parseCppClassSpecifier; the node Type stays
+// "struct_declaration" so rules can target structs vs classes when desired.
+func parseCppStructSpecifier(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string) {
+	body := node.ChildByFieldName("body")
+	if body == nil {
+		// `struct S` used as a type reference. Skip.
+		return
+	}
+
+	name, isAnonymous := extractTaggedName(node, sourceCode)
+	fields := clike.ExtractStructFields(body, sourceCode)
+	fieldStrings := make([]string, 0, len(fields))
+	for _, f := range fields {
+		if f.Name == "" {
+			fieldStrings = append(fieldStrings, f.TypeStr)
+		} else {
+			fieldStrings = append(fieldStrings, f.Name+": "+f.TypeStr)
+		}
+	}
+
+	superClass, inheritance := extractBaseClasses(node, sourceCode)
+
+	metadata := map[string]any{}
+	if isAnonymous {
+		metadata[metaIsAnonymous] = true
+	}
+	if len(inheritance) > 0 {
+		metadata[metaInheritance] = inheritance
+	}
+
+	g.AddNode(&Node{
+		ID:                  GenerateSha256("struct:" + name + "@" + file + "#" + lineRange(node)),
+		Type:                nodeTypeStructDeclaration,
+		Name:                name,
+		LineNumber:          node.StartPoint().Row + 1,
+		MethodArgumentsType: fieldStrings,
+		SuperClass:          superClass,
+		File:                file,
+		Language:            languageCpp,
+		SourceLocation:      newSourceLocation(file, node),
+		Metadata:            metadata,
+	})
+}
+
+// parseCppEnumSpecifier records a C++ enum. C++ adds `enum class` (scoped
+// enums); when present, Metadata["is_scoped"] = true. Otherwise the
+// behaviour matches parseCEnumSpecifier — we don't share the function
+// because the AST shape differs (C++ has an additional `enum class` keyword
+// node).
+func parseCppEnumSpecifier(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string) {
+	body := node.ChildByFieldName("body")
+	if body == nil {
+		return
+	}
+
+	name, isAnonymous := extractTaggedName(node, sourceCode)
+	enumerators := extractEnumerators(body, sourceCode)
+	isScoped := hasChildOfType(node, "class") || hasChildOfType(node, "struct")
+
+	metadata := map[string]any{
+		metaEnumerators: enumerators,
+	}
+	if isAnonymous {
+		metadata[metaIsAnonymous] = true
+	}
+	if isScoped {
+		metadata["is_scoped"] = true
+	}
+
+	g.AddNode(&Node{
+		ID:             GenerateSha256("enum:" + name + "@" + file + "#" + lineRange(node)),
+		Type:           nodeTypeEnumDeclaration,
+		Name:           name,
+		LineNumber:     node.StartPoint().Row + 1,
+		File:           file,
+		Language:       languageCpp,
+		SourceLocation: newSourceLocation(file, node),
+		Metadata:       metadata,
+	})
+}
+
+// parseCppTypeDefinition records a C++ typedef. C++ also has `using` alias
+// declarations that parse as `alias_declaration` (handled in a future PR);
+// `typedef` itself parses identically to C, so we simply re-tag the
+// produced node with Language="cpp".
+func parseCppTypeDefinition(node *sitter.Node, sourceCode []byte, g *CodeGraph, file string) {
+	typeNode := node.ChildByFieldName("type")
+	underlying := ""
+	if typeNode != nil {
+		underlying = strings.TrimSpace(typeNode.Content(sourceCode))
+	}
+	for _, declarator := range childDeclarators(node) {
+		aliasName := bareIdentifierName(declarator, sourceCode)
+		if aliasName == "" {
+			aliasName = strings.TrimSpace(declarator.Content(sourceCode))
+		}
+		g.AddNode(&Node{
+			ID:             GenerateSha256("typedef:" + aliasName + "@" + file + "#" + lineRange(node)),
+			Type:           nodeTypeTypeDefinition,
+			Name:           aliasName,
+			DataType:       underlying,
+			LineNumber:     node.StartPoint().Row + 1,
+			File:           file,
+			Language:       languageCpp,
+			SourceLocation: newSourceLocation(file, node),
+			Metadata:       map[string]any{metaUnderlyingType: underlying},
+		})
+	}
+}
+
+// =============================================================================
+// Access specifier propagation
+// =============================================================================
+
+// recordAccessSpecifier updates the access-tracking state on the enclosing
+// class node. tree-sitter emits access_specifier as a sibling preceding the
+// fields/methods it governs; the dispatcher in parser.go calls this when
+// it sees one, so subsequent parseCppFieldDeclaration / parseCppFunctionDefinition
+// calls (which run on the same currentContext map) read the updated value.
+func recordAccessSpecifier(node *sitter.Node, sourceCode []byte, currentContext *Node) {
+	classNode := classFromContext(currentContext)
+	if classNode == nil {
+		return
+	}
+	access := strings.TrimSpace(node.Content(sourceCode))
+	if classNode.Metadata == nil {
+		classNode.Metadata = map[string]any{}
+	}
+	classNode.Metadata[metaCurrentAccess] = access
+}
+
+// =============================================================================
+// Internal helpers
+// =============================================================================
+
+// classFromContext returns currentContext when it is a class_declaration
+// node, otherwise nil. Used by method and field handlers to detect class
+// membership.
+func classFromContext(currentContext *Node) *Node {
+	if currentContext != nil && currentContext.Type == nodeTypeClassDeclaration {
+		return currentContext
+	}
+	return nil
+}
+
+// classAccessFromContext returns the access specifier currently in effect
+// for the class node (the value most recently set by recordAccessSpecifier).
+// Returns "" when no access specifier has been seen yet — tree-sitter
+// preserves source order, so the first members of a class declared with no
+// preceding access_specifier inherit the class's default ("private" for
+// `class`, "public" for `struct`).
+func classAccessFromContext(classNode *Node) string {
+	if classNode == nil || classNode.Metadata == nil {
+		return ""
+	}
+	if v, ok := classNode.Metadata[metaCurrentAccess].(string); ok {
+		return v
+	}
+	return ""
+}
+
+// packageNameFromContext walks currentContext to find the enclosing
+// namespace's PackageName. Both namespace and class context nodes carry
+// PackageName; when neither is present, returns "".
+func packageNameFromContext(currentContext *Node) string {
+	if currentContext == nil {
+		return ""
+	}
+	return currentContext.PackageName
+}
+
+// scopedName joins a class name with a member name using "::" separator.
+// Used to produce stable IDs that distinguish identical member names
+// across classes in the same translation unit.
+func scopedName(classNode *Node, memberName string) string {
+	if classNode == nil || classNode.Name == "" {
+		return memberName
+	}
+	return classNode.Name + "::" + memberName
+}
+
+// combineModifiers joins a single access modifier ("public"/"private"/...)
+// with a list of storage class specifiers ("static", "inline", ...) into
+// the single Modifier string used by graph.Node. Empty inputs are skipped.
+func combineModifiers(access string, storageClasses []string) string {
+	parts := []string{}
+	if access != "" {
+		parts = append(parts, access)
+	}
+	parts = append(parts, storageClasses...)
+	return strings.Join(parts, " ")
+}
+
+// collectVirtualSpecifiers walks the children of a function_definition or
+// field_declaration looking for the specifier keywords that affect method
+// dispatch: "virtual" appears as a keyword child, "override"/"final" as
+// virtual_specifier nodes. Returns a presence-set keyed by the keyword.
+func collectVirtualSpecifiers(node *sitter.Node, sourceCode []byte) map[string]bool {
+	out := map[string]bool{}
+	for i := 0; i < int(node.ChildCount()); i++ {
+		c := node.Child(i)
+		if c == nil {
+			continue
+		}
+		if c.Type() == "virtual" {
+			out["virtual"] = true
+			continue
+		}
+		if c.Type() == "virtual_specifier" {
+			text := strings.TrimSpace(c.Content(sourceCode))
+			out[text] = true
+		}
+	}
+	// function_declarator inside the field/function may also carry the
+	// virtual_specifier (e.g. override appears after the parameter list).
+	if decl := node.ChildByFieldName("declarator"); decl != nil {
+		for i := 0; i < int(decl.ChildCount()); i++ {
+			c := decl.Child(i)
+			if c != nil && c.Type() == "virtual_specifier" {
+				text := strings.TrimSpace(c.Content(sourceCode))
+				out[text] = true
+			}
+		}
+	}
+	return out
+}
+
+// hasChildOfType reports whether any direct child of node has the given type.
+func hasChildOfType(node *sitter.Node, nodeType string) bool {
+	for i := 0; i < int(node.ChildCount()); i++ {
+		c := node.Child(i)
+		if c != nil && c.Type() == nodeType {
+			return true
+		}
+	}
+	return false
+}
+
+// isFunctionDeclarator walks past pointer/reference/array wrappers to
+// determine whether a declarator chain bottoms out at a function_declarator.
+func isFunctionDeclarator(node *sitter.Node) bool {
+	for cur := node; cur != nil; {
+		switch cur.Type() {
+		case "function_declarator":
+			return true
+		case "pointer_declarator", "reference_declarator", "array_declarator":
+			cur = cur.ChildByFieldName("declarator")
+			continue
+		}
+		return false
+	}
+	return false
+}
+
+// isDestructorName reports whether name is a destructor name (`~ClassName`).
+// Destructors are stored as Name="~ClassName" by clike so detection is a
+// single-character prefix check.
+func isDestructorName(name string) bool {
+	return strings.HasPrefix(name, "~")
+}
+
+// lastNamedIdentifier returns the content of the last identifier or
+// type_identifier among node's named children. Used by template parameter
+// extraction where the parameter name is the last token after `typename`,
+// `class`, or a type expression.
+func lastNamedIdentifier(node *sitter.Node, sourceCode []byte) string {
+	last := ""
+	for i := 0; i < int(node.NamedChildCount()); i++ {
+		c := node.NamedChild(i)
+		if c == nil {
+			continue
+		}
+		if c.Type() == "identifier" || c.Type() == "type_identifier" {
+			last = c.Content(sourceCode)
+		}
+	}
+	return last
+}
diff --git a/sast-engine/graph/parser_cpp_test.go b/sast-engine/graph/parser_cpp_test.go
new file mode 100644
index 00000000..f9595664
--- /dev/null
+++ b/sast-engine/graph/parser_cpp_test.go
@@ -0,0 +1,400 @@
+package graph
+
+import (
+	"testing"
+)
+
+// TestParseCppEndToEnd parses ../testdata/cpp/ as a complete project via
+// Initialize and asserts every C++-only node category is correctly
+// populated:
+//
+//   - Classes with inheritance (Dog : public Animal)
+//   - Pure virtual methods (Animal::speak = 0)
+//   - Virtual destructors (~Animal)
+//   - Override-marked methods (Dog::speak override)
+//   - Class data members (Dog::age)
+//   - Access specifier propagation (public/private)
+//   - Out-of-line method definitions (Dog::bark())
+//   - Namespaces (named and anonymous)
+//   - Templates (typename T)
+//   - Struct (C++ flavour, public default access)
+//   - Scoped enum (enum class)
+//   - Typedef
+//   - throw / try-catch
+//   - Method calls with arrow / dot / qualified shapes
+//   - Forward declarations in headers
+//
+// Runs end-to-end through Initialize() so the dispatcher in parser.go is
+// exercised alongside the parse functions in parser_cpp.go.
+func TestParseCppEndToEnd(t *testing.T) {
+	graph := Initialize("../testdata/cpp", nil)
+	if graph == nil {
+		t.Fatal("Initialize returned nil")
+	}
+
+	nodes := collectByType(graph)
+
+	t.Run("class_declarations_with_inheritance", func(t *testing.T) {
+		classes := nodes[nodeTypeClassDeclaration]
+		byName := indexByName(classes)
+
+		animal := byName["Animal"]
+		dog := byName["Dog"]
+		if animal == nil {
+			t.Fatal("expected class Animal")
+		}
+		if dog == nil {
+			t.Fatal("expected class Dog")
+		}
+
+		if dog.SuperClass != "Animal" {
+			t.Errorf("Dog.SuperClass = %q, want %q", dog.SuperClass, "Animal")
+		}
+		inheritance, _ := dog.Metadata[metaInheritance].([]string)
+		if len(inheritance) == 0 || inheritance[0] != "public Animal" {
+			t.Errorf("Dog inheritance = %v, want [public Animal]", inheritance)
+		}
+		if animal.Language != languageCpp {
+			t.Errorf("Animal.Language = %q, want %q", animal.Language, languageCpp)
+		}
+	})
+
+	t.Run("namespace_propagates_to_classes", func(t *testing.T) {
+		dog := indexByName(nodes[nodeTypeClassDeclaration])["Dog"]
+		if dog == nil {
+			t.Fatal("expected class Dog")
+		}
+		if dog.PackageName != "mylib" {
+			t.Errorf("Dog.PackageName = %q, want %q", dog.PackageName, "mylib")
+		}
+	})
+
+	t.Run("anonymous_namespace_has_no_name", func(t *testing.T) {
+		anon := false
+		for _, n := range nodes["namespace_definition"] {
+			if n.Name == "" {
+				anon = true
+				break
+			}
+		}
+		if !anon {
+			t.Error("expected at least one anonymous namespace_definition")
+		}
+	})
+
+	t.Run("method_declarations_with_access_and_override", func(t *testing.T) {
+		methods := nodes[nodeTypeMethodDeclaration]
+		var speakAnimal, speakDog, bark *Node
+		for _, m := range methods {
+			switch {
+			case m.Name == "speak" && hasMetadataBool(m, metaIsPureVirtual):
+				speakAnimal = m
+			case m.Name == "speak" && hasMetadataBool(m, metaIsOverride):
+				speakDog = m
+			case m.Name == "bark":
+				bark = m
+			}
+		}
+
+		if speakAnimal == nil {
+			t.Fatal("expected pure virtual speak in Animal")
+		}
+		if !hasMetadataBool(speakAnimal, metaIsPureVirtual) {
+			t.Error("Animal.speak should be marked pure virtual")
+		}
+		if !hasMetadataBool(speakAnimal, metaIsVirtual) {
+			t.Error("Animal.speak should be marked virtual")
+		}
+		if speakAnimal.Modifier != "public" {
+			t.Errorf("Animal.speak Modifier = %q, want %q", speakAnimal.Modifier, "public")
+		}
+
+		if speakDog == nil {
+			t.Fatal("expected Dog.speak override")
+		}
+		if !hasMetadataBool(speakDog, metaIsOverride) {
+			t.Error("Dog.speak should be marked override")
+		}
+
+		if bark == nil {
+			t.Fatal("expected private method bark")
+		}
+		if bark.Modifier != "private" {
+			t.Errorf("Dog.bark Modifier = %q, want %q", bark.Modifier, "private")
+		}
+	})
+
+	t.Run("destructor_recognised_as_method", func(t *testing.T) {
+		methods := nodes[nodeTypeMethodDeclaration]
+		found := false
+		for _, m := range methods {
+			if m.Name == "~Animal" {
+				found = true
+				if !hasMetadataBool(m, metaIsDestructor) {
+					t.Error("destructor should carry is_destructor metadata")
+				}
+				break
+			}
+		}
+		if !found {
+			t.Error("expected destructor ~Animal among method_declaration nodes")
+		}
+	})
+
+	t.Run("class_field_declaration", func(t *testing.T) {
+		fields := nodes[nodeTypeFieldDecl]
+		var age *Node
+		for _, f := range fields {
+			if f.Name == "age" {
+				age = f
+				break
+			}
+		}
+		if age == nil {
+			t.Fatal("expected field 'age' in Dog")
+		}
+		if age.DataType != "int" {
+			t.Errorf("age.DataType = %q, want %q", age.DataType, "int")
+		}
+		if age.Modifier != "public" {
+			t.Errorf("age.Modifier = %q, want %q", age.Modifier, "public")
+		}
+	})
+
+	t.Run("template_parameters_recorded", func(t *testing.T) {
+		templates := nodes["template_declaration"]
+		if len(templates) == 0 {
+			t.Fatal("expected at least one template_declaration node")
+		}
+		params, _ := templates[0].Metadata[metaTemplateParams].([]string)
+		if len(params) == 0 || params[0] != "T" {
+			t.Errorf("template params = %v, want [T]", params)
+		}
+	})
+
+	t.Run("throw_statement", func(t *testing.T) {
+		throws := nodes[nodeTypeThrowStmt]
+		if len(throws) == 0 {
+			t.Fatal("expected ThrowStmt node")
+		}
+		expr, _ := throws[0].Metadata[metaThrowExpr].(string)
+		if expr == "" {
+			t.Errorf("throw_expression should not be empty")
+		}
+	})
+
+	t.Run("try_statement_with_catch_types", func(t *testing.T) {
+		tries := nodes[nodeTypeTryStmt]
+		if len(tries) == 0 {
+			t.Fatal("expected TryStmt node")
+		}
+		catches, _ := tries[0].Metadata[metaCatchClauses].([]string)
+		if len(catches) == 0 || catches[0] == "" {
+			t.Errorf("expected at least one catch clause type, got %v", catches)
+		}
+	})
+
+	t.Run("call_expressions_with_shapes", func(t *testing.T) {
+		calls := nodes[nodeTypeCallExpression]
+		hasArrow, hasMethodDot, hasQualified := false, false, false
+		for _, c := range calls {
+			if hasMetadataBool(c, metaIsArrow) {
+				hasArrow = true
+			}
+			if hasMetadataBool(c, metaIsMethod) && !hasMetadataBool(c, metaIsArrow) {
+				hasMethodDot = true
+			}
+			if hasMetadataBool(c, metaIsQualified) {
+				hasQualified = true
+			}
+		}
+		// example.cpp uses d.speak() (dot) and identity<int>(42) (qualified-like)
+		// but no arrow call. We assert the shapes that appear.
+		if !hasMethodDot {
+			t.Error("expected at least one dot method call")
+		}
+		if !hasQualified {
+			t.Error("expected at least one qualified call (e.g., identity<int>())")
+		}
+		_ = hasArrow
+	})
+
+	t.Run("scoped_enum_marked", func(t *testing.T) {
+		enums := nodes[nodeTypeEnumDeclaration]
+		var color *Node
+		for _, e := range enums {
+			if e.Name == "Color" {
+				color = e
+				break
+			}
+		}
+		if color == nil {
+			t.Fatal("expected enum Color")
+		}
+		if v, _ := color.Metadata["is_scoped"].(bool); !v {
+			t.Error("Color should be marked is_scoped (declared as enum class)")
+		}
+	})
+
+	t.Run("typedef_recorded_with_cpp_language", func(t *testing.T) {
+		typedefs := nodes[nodeTypeTypeDefinition]
+		var alias *Node
+		for _, td := range typedefs {
+			if td.Name == "size_alias" {
+				alias = td
+				break
+			}
+		}
+		if alias == nil {
+			t.Fatal("expected typedef size_alias")
+		}
+		if alias.Language != languageCpp {
+			t.Errorf("size_alias.Language = %q, want %q", alias.Language, languageCpp)
+		}
+		if alias.DataType != "unsigned long" {
+			t.Errorf("size_alias.DataType = %q, want %q", alias.DataType, "unsigned long")
+		}
+	})
+
+	t.Run("struct_with_cpp_language", func(t *testing.T) {
+		structs := nodes[nodeTypeStructDeclaration]
+		var point *Node
+		for _, s := range structs {
+			if s.Name == "Point" {
+				point = s
+				break
+			}
+		}
+		if point == nil {
+			t.Fatal("expected struct Point")
+		}
+		if point.Language != languageCpp {
+			t.Errorf("Point.Language = %q, want %q", point.Language, languageCpp)
+		}
+		if len(point.MethodArgumentsType) != 2 {
+			t.Errorf("Point fields = %v, want 2 entries", point.MethodArgumentsType)
+		}
+	})
+
+	t.Run("forward_declarations_in_header", func(t *testing.T) {
+		// buffer.hpp declares Buffer class with constructor, destructor,
+		// and append() method as inline declarations. They should appear
+		// as method_declaration nodes with is_declaration=true.
+		methods := nodes[nodeTypeMethodDeclaration]
+		seen := map[string]bool{}
+		for _, m := range methods {
+			if m.File == "../testdata/cpp/buffer.hpp" {
+				seen[m.Name] = true
+			}
+		}
+		for _, want := range []string{"Buffer", "~Buffer", "append"} {
+			if !seen[want] {
+				t.Errorf("expected %q method declaration in buffer.hpp; saw %v", want, seen)
+			}
+		}
+	})
+
+	t.Run("regression_no_java_tagged_nodes_in_cpp_files", func(t *testing.T) {
+		for _, n := range graph.Nodes {
+			if n.File == "../testdata/cpp/example.cpp" || n.File == "../testdata/cpp/buffer.hpp" {
+				if n.Language != languageCpp {
+					t.Errorf("node %q (%s) in C++ file has Language=%q, want %q",
+						n.Name, n.Type, n.Language, languageCpp)
+				}
+			}
+		}
+	})
+}
+
+// TestParseCppClassSpecifier_ForwardDeclaration verifies the no-body
+// short-circuit so a forward `class Foo;` does not produce a phantom
+// class_declaration node.
+func TestParseCppClassSpecifier_ForwardDeclaration(t *testing.T) {
+	code := "class Foo;"
+	tree, root := parseSnippetForTest(t, code, true)
+	defer tree.Close()
+
+	cls := findFirstNodeOfType(root, "class_specifier")
+	if cls == nil {
+		t.Fatal("class_specifier not found")
+	}
+	g := NewCodeGraph()
+	if got := parseCppClassSpecifier(cls, []byte(code), g, "f.cpp", nil); got != nil {
+		t.Errorf("forward declaration should return nil, got %+v", got)
+	}
+	if len(g.Nodes) != 0 {
+		t.Errorf("forward declaration should not add nodes, got %d", len(g.Nodes))
+	}
+}
+
+// TestExtractCatchExceptionTypes_CatchAll covers the `catch (...)` shape
+// where tree-sitter emits the parameter list as the literal "(...)" with
+// no parameter_declaration child — extractCatchExceptionTypes should
+// emit "..." for that handler.
+func TestExtractCatchExceptionTypes_CatchAll(t *testing.T) {
+	code := `void f() { try { dangerous(); } catch (...) { recover(); } }`
+	tree, root := parseSnippetForTest(t, code, true)
+	defer tree.Close()
+
+	try := findFirstNodeOfType(root, "try_statement")
+	if try == nil {
+		t.Fatal("try_statement not found")
+	}
+	got := extractCatchExceptionTypes(try, []byte(code))
+	if len(got) != 1 || got[0] != "..." {
+		t.Errorf("got %v, want [...]", got)
+	}
+}
+
+// TestExtractTemplateParameters_Nil verifies the nil-list guard.
+func TestExtractTemplateParameters_Nil(t *testing.T) {
+	if got := extractTemplateParameters(nil, nil); got != nil {
+		t.Errorf("got %v, want nil", got)
+	}
+}
+
+// TestRecordAccessSpecifier_OutsideClassContext is a no-op — the access
+// specifier flows only when currentContext is a class node. This covers
+// the early-return path when recordAccessSpecifier is invoked with no
+// surrounding class (e.g., bug or future struct support that calls it
+// with a non-class context).
+func TestRecordAccessSpecifier_OutsideClassContext(t *testing.T) {
+	code := `class C { public: };`
+	tree, root := parseSnippetForTest(t, code, true)
+	defer tree.Close()
+
+	access := findFirstNodeOfType(root, "access_specifier")
+	if access == nil {
+		t.Fatal("access_specifier not found")
+	}
+
+	// Pass a non-class context (e.g., a function node) — should be a no-op.
+	notAClass := &Node{Type: nodeTypeFunctionDefinition, Metadata: map[string]any{}}
+	recordAccessSpecifier(access, []byte(code), notAClass)
+	if _, set := notAClass.Metadata[metaCurrentAccess]; set {
+		t.Error("recordAccessSpecifier should not mutate non-class context")
+	}
+}
+
+// indexByName returns a map of Name → Node for fast lookup. Names that
+// repeat across files keep whichever one comes first; tests that need
+// disambiguation should walk the slice directly instead.
+func indexByName(nodes []*Node) map[string]*Node {
+	out := make(map[string]*Node, len(nodes))
+	for _, n := range nodes {
+		if _, ok := out[n.Name]; !ok {
+			out[n.Name] = n
+		}
+	}
+	return out
+}
+
+// hasMetadataBool returns the bool value stored under key, defaulting to
+// false when absent or wrong type. Cleans up assertion code.
+func hasMetadataBool(n *Node, key string) bool {
+	if n == nil || n.Metadata == nil {
+		return false
+	}
+	v, _ := n.Metadata[key].(bool)
+	return v
+}
diff --git a/sast-engine/graph/testdata/c/buffer.h b/sast-engine/testdata/c/buffer.h
similarity index 100%
rename from sast-engine/graph/testdata/c/buffer.h
rename to sast-engine/testdata/c/buffer.h
diff --git a/sast-engine/graph/testdata/c/example.c b/sast-engine/testdata/c/example.c
similarity index 100%
rename from sast-engine/graph/testdata/c/example.c
rename to sast-engine/testdata/c/example.c
diff --git a/sast-engine/testdata/cpp/buffer.hpp b/sast-engine/testdata/cpp/buffer.hpp
new file mode 100644
index 00000000..ee891566
--- /dev/null
+++ b/sast-engine/testdata/cpp/buffer.hpp
@@ -0,0 +1,16 @@
+#pragma once
+
+namespace mylib {
+
+class Buffer {
+public:
+    Buffer();
+    ~Buffer();
+    void append(const char* data, std::size_t n);
+
+private:
+    char* data_;
+    std::size_t len_;
+};
+
+}  // namespace mylib
diff --git a/sast-engine/testdata/cpp/example.cpp b/sast-engine/testdata/cpp/example.cpp
new file mode 100644
index 00000000..2c091579
--- /dev/null
+++ b/sast-engine/testdata/cpp/example.cpp
@@ -0,0 +1,67 @@
+#include <iostream>
+#include <stdexcept>
+#include "buffer.hpp"
+
+namespace mylib {
+
+class Animal {
+public:
+    virtual void speak() = 0;
+    virtual ~Animal();
+};
+
+class Dog : public Animal {
+public:
+    void speak() override;
+    int age;
+private:
+    void bark();
+};
+
+void Dog::speak() {
+    std::cout << "woof" << std::endl;
+}
+
+void Dog::bark() {
+    speak();
+}
+
+Dog::~Animal() {
+}
+
+template<typename T>
+T identity(T v) {
+    return v;
+}
+
+void process(const std::string& msg) {
+    try {
+        if (msg.empty()) {
+            throw std::runtime_error("empty");
+        }
+        identity<int>(42);
+    } catch (const std::exception& e) {
+        std::cerr << e.what();
+    }
+}
+
+}  // namespace mylib
+
+namespace {
+int hidden_counter = 0;
+}
+
+struct Point {
+    int x;
+    int y;
+};
+
+enum class Color { Red, Green, Blue };
+
+typedef unsigned long size_alias;
+
+int main() {
+    mylib::Dog d;
+    d.speak();
+    return 0;
+}