One Billion Row Challenge

This is my implementation of the One Billion Row Challenge - a performance challenge to process a large dataset of weather measurements as quickly as possible.

How to Run

Clone the challenge repository: git clone git@github.com:gunnarmorling/1brc.git
Make sure to have Java installed and then run:

    ./mvnw clean verify
    ./create_measurements.sh 1000000000

Run the program: go run main.go

Optimizations

Map-reduce approach with parallel workers
Efficient chunk-based file reading - instead of reading line by line, we read in chunks of 4MB
Process the data as bytes instead of strings to avoid string allocations
Custom line splitting instead of using string.Split
Custom temperature parsing function instead of standard string conversion
Optimized city name deduplication using integer indices instead of string keys

Architecture

                                  ┌─────────────┐
                                  │   Monitor   │
                                  │  Goroutine  │
                                  └─────────────┘
                                        │
                                        ▼
┌─────────────┐    ┌───────────┐    ┌─────────┐    ┌───────────────┐
│  File Reader│───▶│ publishCh │───▶│ Workers │───▶│ Results Channel│
│  Goroutine  │    └───────────┘    │(Map)    │    └───────────────┘
└─────────────┘                     └─────────┘            │
                                                           ▼
                                                   ┌───────────────┐
                                                   │ Reduce Phase  │
                                                   │ (Main Thread) │
                                                   └───────────────┘
                                                           │
                                                           ▼
                                                   ┌───────────────┐
                                                   │ Save Results  │
                                                   └───────────────┘

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
monitor		monitor
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One Billion Row Challenge

How to Run

Optimizations

Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

One Billion Row Challenge

How to Run

Optimizations

Architecture

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages