Nutmeg Tokenizer

A standalone tokenizer for the Nutmeg programming language, implemented in Go.

Features

Tokenizes Nutmeg source code into JSON format
Supports all Nutmeg token types including:
- Numeric literals with multiple radixes (2-36)
- String literals with escape sequences
- Identifiers and keywords
- Operators with precedence information
- Delimiters with usage context
Command-line interface for file processing or stdin input
Comprehensive test suite

Installation

go build -o nutmeg-tokenizer ./cmd/nutmeg-tokenizer

Usage

Command Line

# Tokenize a file
./nutmeg-tokenizer examples/simple.nutmeg

# Read from stdin
echo "def hello end" | ./nutmeg-tokenizer -

# Show help
./nutmeg-tokenizer --help

As a Library

package main

import (
    "fmt"
    "github.com/spicery/nutmeg-tokenizer/pkg/tokenizer"
)

func main() {
    t := tokenizer.New("def hello(name) name end")
    tokens, err := t.Tokenize()
    if err != nil {
        panic(err)
    }

    for _, token := range tokens {
        fmt.Printf("%s: %s\n", token.Type, token.Text)
    }
}

Token Types

n - Numeric literals
s - String literals
S - Start tokens (def, if, while)
E - End tokens (end, endif, endwhile)
C - Compound tokens
L - Label tokens
P - Prefix tokens
V - Variable tokens
O - Operator tokens
[ - Open delimiters
] - Close delimiters
U - Unclassified tokens

Output Format

Each token is output as a JSON object with the following structure:

{
  "text": "def",
  "span": [1, 1, 1, 4],
  "type": "S",
  "closed_by": ["end"]
}

Testing

go test ./pkg/tokenizer

Examples

See the examples/ directory for sample Nutmeg code that demonstrates various token types.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.github		.github
.tools		.tools
cmd/nutmeg-tokenizer		cmd/nutmeg-tokenizer
docs		docs
examples		examples
pkg/tokenizer		pkg/tokenizer
.gitignore		.gitignore
.goreleaser.yml		.goreleaser.yml
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Justfile		Justfile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nutmeg Tokenizer

Features

Installation

Usage

Command Line

As a Library

Token Types

Output Format

Testing

Examples

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Nutmeg Tokenizer

Features

Installation

Usage

Command Line

As a Library

Token Types

Output Format

Testing

Examples

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages