Back to basics NCAA march madness basketball pool software implemented in pure C for speed. No nonsensical bit twiddling. 40.2M bracket scores per second.
Implemented as a single file header for command line pool management. This may change at some point, not sure what benefit this has as the use case I was thinking of no longer applies.
Run
$ make
$ ./pool -hUses the value of CC environment variable, or cc by default (clang on Darwin).
Use clang explicitly:
$ CC=clang makeTested on Linux Mint, Ubuntu, Windows 11 (MinGW), Windows 11 (WSL) and Mac OS X (M1).
You can use this software to run your office or friend group NCAA pool.
- Create a directory on your computer to hold the pool files.
- Create the
config.txtfile and set your pool's name, scoring method and round scores.- If you are running this pool for profit, also fill in the
feeandpayoutslines. - The
feeis the number of currency units you collect per entry. - The
payoutsline is a comma separated list of no more than 4 integers which specify the payouts by rank. The first 3 values pay out 1st, 2nd, and 3rd place. The 4th value pays out last place.- positive numbers specify a percentage of the total fees collected
- -1 specifies that place gets the entry fee back.
- The sum of the percentages has to equal exactly 100
- Sample
config.txtfile
- If you are running this pool for profit, also fill in the
- Create the
teams.txtfile in the directory.- Each line is in the form "Long Name,3-character Short Name". Any extra fields (like seed number) are ignored.
- Each team must have a unique short name.
- The short names are case sensitive.
- For the four play in games, use
PI1,PI2,PI3andPI4as the short name. The long name is irrelevant. - The order of teams listed is as follows: top left region first, bottom left region second, top right region third, bottom right region fourth. The left side plays the right side in the championship. Within each region, teams are in play order by seed as follows: 1, 16, 8, 9, 5, 12, 4, 13, 6, 11, 3, 14, 7, 10, 2, 15
- Lines starting with a
#character are considered comments and ignored - Optionally, include exactly 4
Region: Namelines to name the four regions (e.g.Region: East). They can appear anywhere in the file. If omitted, regions are labeled Region 1 through Region 4 in the teams report. - Sample
teams.txtfile
- Collect entries
- You can hand out forms, set up a pool on one of the free websites or send out the included bracket HTML entry collector page via email.
- The HTML entry collector needs to be edited with the current year's
teams.
- New for 2024: there is a ruby script to generate the web entry collector
from the teams file. You need a working ruby install and can run it as
follows:
ruby web/make_bracket.rb ./2026/teams.txt ./web/2026_bracket.html ./2026/ncaa_2026.logo- 1st argument: the location of the generator script in this repo
- 2nd argument: the location of the teams file
- 3rd argument: location where to save the bracket
- 4th argument: the location of a logo image file in urldata format.
- NOTE: The teams file MUST contain exactly 4
Region: Namelines for the bracket generator to label the regions correctly. See the sample teams.txt file above for correct format.
- New for 2024: there is a ruby script to generate the web entry collector
from the teams file. You need a working ruby install and can run it as
follows:
- Here is what the HTML entry collector looks like: Latest NCAA Tournament Bracket
- Create the
entriessubfolder and copy/create/save any entry files you got from the pool entrants. - Generate an entries report and send it out to everyone for confirmation:
./pool -d mypool entries - Handle the play in games.
- As the play in games are played, add to
the pool
config.txtfile lines that map the place holder short team names (PI[1234]) with a new short name matching the winner. You will end up with lines like:PI1=Wag PI2=Cdo PI3=GrS PI4=CSt - Make sure to change the long name of the play in teams in the
teams.txtfile, but don't modify thePI[1234]short names.
- As the play in games are played, add to
the pool
- As games are played in the tournament, record the winners in the
results.txtfile, one team short name per line. For play in games, you can use either the originalPI[1234]short names or the mapped short names of the play in games from the step above. - Run the scores report:
./pool -d mypool scoresuntil the first round is complete. - Run your first possibilities report as soon as your machine
can handle it:
./pool -d mypool -p possand pass around the results via email, slack, discord or whatever. - When the final four teams are determined, run the Final Four
report to show all top 4 standings and payouts (if configured) for
each of the remaining possibilities:
./pool -d mypool ffour
The pool configuration specifies the scorer to use in the pool.
Each scorer uses as input the bracket being scored, the winning
team number, the losing team number, the round number and the
game number. In addition to the scorer, the pool configuration
specifies 6 round scores that the scorer can use. The default
round scores are 1, 2, 4, 8, 16, 32. To override the default,
supply a roundScores configuration option in the config.txt
file:
roundScores=1,2,4,8,12,24
These are the supported scorers:
- Basic: each correct pick is worth a constant amount - the round round score configured for that round.
- Upset: each correct pick is worth the round score for the game’s round plus the seed number of the winning team.
- SeedDiff: each correct pick is worth the round score for the game’s round plus the difference in seeds of the winning team and the losing team. The bonus points only apply if the loser was picked correctly and the winner’s seed is greater than the loser’s seed.
- RelaxedSeedDiff: each correct pick is worth the round score for the game’s round plus the difference in seeds of the winning team and the losing team. The bonus points only apply if winner’s seed is greater than the loser’s seed. This differs from SeedDiff in that the loser in the game need not have been picked correctly.
- JoshP: each correct pick is worth the round score for the game's round multiplied by the seed number of the winning team.
- UpsetMultiplier: each correct pick is worth the round score for the game's round multiplied by the winner's seed divided by the loser's seed when the winner's seed is greater (an upset). Otherwise it is worth just the round score. The actual opponent is used, like RelaxedSeedDiff.
M1 mac can analyze 2.147B possible outcomes against 50 entries in 37 minutes once there are 32 teams remaining.
Supercalifragilistic Pool: Possibilities Report
There are 32 teams and 31 games remaining, 2.147B possible outcomes
DFS BPS: 968207 100% ELAPSED: 00:36:58
Min Max Curr Max Win Times Times
Name Rank Rank Score Score Chance Won Tied Top Champs
entry23 1 29 193 497 17.93 385M 11.17M Cat,foe,wil,ane,hea
entry26 1 36 169 521 15.94 342.3M 8.821M mor,cir,mas,ten,gri
entry30 1 29 184 474 14.92 320.3M 11.19M slu,dis,dee,mus,hea
entry32 1 31 184 420 7.61 163.5M 6.399M lan,pos,dur,wid,sho
entry33 1 38 161 460 6.53 140.3M 5.080M fan,cla,ove,can,ane
entry7 1 37 165 444 5.13 110M 5.036M lan,aba,ext,spl,Cha
entry3 1 42 150 460 4.70 100.9M 4.543M dum,can,ten,ane,pos
entry9 1 33 169 439 4.43 95.10M 4.895M Cha,mus,ane,Wis,can
entry4 1 31 180 396 4.26 91.41M 4.098M moo,spl,ext,wid,aba
entry14 1 41 165 406 4.24 91.14M 4.179M dee,dum,mas,gri,Wis
entry47 1 44 155 380 2.57 55.11M 2.515M wil,mas,wid,pos,gri
entry50 1 36 167 423 2.11 45.27M 2.912M Cha,sho,aba,rap,pos
entry28 1 44 147 438 1.66 35.60M 1.718M mus,cir,lan,rap,gri
entry21 1 42 154 403 1.55 33.21M 1.777M mus,hea,spl,Wis,rap
entry19 1 41 158 363 1.20 25.79M 1.701M cla,spl,wid,sho,gri
entry38 1 40 161 368 1.19 25.66M 1.612M rap,wil,sho,spl,wid
entry6 1 48 121 423 0.82 17.61M 1.025M ten,ext,dur,foe,cir
entry13 1 48 128 352 0.40 8.512M 512.9K cir,Wis,pin,ove,dum
entry20 1 44 146 333 0.31 6.577M 587.7K dee,pin,ten,cla,ove
entry10 1 43 150 345 0.24 5.236M 477K sho,cla,ext,cir,aba
entry36 1 49 119 392 0.12 2.644M 213.4K aba,dur,dis,wid,spl
entry16 1 44 146 353 0.10 2.212M 208.7K moo,foe,mas,Wis,ext
entry41 1 40 155 302 0.06 1.254M 186.9K gri,aba,wid,pos,spl
entry29 1 50 102 303 0.05 1.042M 86.31K mas,spl,ext,aba,pos
entry37 1 48 133 335 0.03 630.7K 107.5K ten,spl,aba,gri,foe
entry46 1 50 122 320 0.02 497.2K 62.95K mus,dur,ten,mas,gri
entry8 1 50 101 311 0.01 232.3K 23.14K rap,pos,pin,can,gri
entry31 1 50 100 318 0.01 166.9K 20.40K moo,ane,Wis,ext,wil
entry34 1 50 124 272 0.00 46.97K 11.22K wid,mas,pin,ove,gri
entry12 1 50 103 267 0.00 2720 752 rap,ane,ext
entry27 1 50 122 255 0.00 29 64 cir,gri,rap,dur,ove
entry17 2 50 114 291 0.00 0 0
entry1 2 48 136 289 0.00 0 0
entry43 2 49 128 284 0.00 0 0
entry24 2 46 140 271 0.00 0 0
entry42 2 49 119 258 0.00 0 0
entry40 3 50 70 253 0.00 0 0
entry15 4 50 112 253 0.00 0 0
entry2 3 50 101 247 0.00 0 0
entry48 6 50 103 236 0.00 0 0
entry35 3 50 91 234 0.00 0 0
entry18 6 50 103 225 0.00 0 0
entry49 8 50 103 219 0.00 0 0
entry45 6 49 112 210 0.00 0 0
entry25 10 50 111 201 0.00 0 0
entry11 10 50 106 199 0.00 0 0
entry22 16 50 104 184 0.00 0 0
entry5 13 50 84 184 0.00 0 0
entry39 18 50 104 177 0.00 0 0
entry44 27 50 91 152 0.00 0 0
We want to generate all possible outcomes and score all the entries against them. Set up a DFS with inputs:
- games - array of game numbers remaining
- gamesLeft - size of this array
- game - game in games array under review
- stats - array of stats structs, one per entry
- maxRank, minRank, maxScore, timesWon, timesTied, possibleScore, champCounts[], bracket
- possibleScore is set to current bracket score given state of tournament so far
Algo:
- Initial setup as above
- Determine the two teams in games[game]
- Assume winner is the first team, calculate gameScore for each entry, add to possibleScore = possibleScore + gameScore
- Recurse with game += 1. If game == gamesLeft, sort stats by possiblScore and update stats for this result, stop recursion.
- Subtract the gameScore from step 2 from each possibleScore of each entry
- Repeat 2-5 with winner assumed to be second team
max recursion depth would be = number of games remaining
We can parallelize the above to a power of 2 processes/servers/cores/etc if the current tournament bracket is advanced and the algo run from there, then results collected and combined.
2 runners:
- 1st runner assumes game 1 winner is team1
- 2nd runner assumes game 1 winner is team2
4 runners:
- 1st runner assumes game 1 winner is team1, game 2 winner is team1
- 2nd runner assumes game 1 winner is team2, game 2 winner is team1
- 3nd runner assumes game 1 winner is team1, game 2 winner is team2
- 4th runner assumes game 1 winner is team2, game 2 winner is team2
8 runners:
- 1st runner assumes game 1 winner is team1, game 2 winner is team1, game 3 winner is team1
- 2nd runner assumes game 1 winner is team2, game 2 winner is team1, game 3 winner is team1
- 3nd runner assumes game 1 winner is team1, game 2 winner is team2, game 3 winner is team1
- 4th runner assumes game 1 winner is team2, game 2 winner is team2, game 3 winner is team1
- 5th runner assumes game 1 winner is team1, game 2 winner is team1, game 3 winner is team2
- 6th runner assumes game 1 winner is team2, game 2 winner is team1, game 3 winner is team2
- 7th runner assumes game 1 winner is team1, game 2 winner is team2, game 3 winner is team2
- 8th runner assumes game 1 winner is team2, game 2 winner is team2, game 3 winner is team2
etc
Example run with 8 processes, collecting results into binary files in the pool directory:
./pool -d test/fifty_entries -b 0 -n 8 -f bin poss &
./pool -d test/fifty_entries -b 1 -n 8 -f bin poss &
./pool -d test/fifty_entries -b 2 -n 8 -f bin poss &
./pool -d test/fifty_entries -b 3 -n 8 -f bin poss &
./pool -d test/fifty_entries -b 4 -n 8 -f bin poss &
./pool -d test/fifty_entries -b 5 -n 8 -f bin poss &
./pool -d test/fifty_entries -b 6 -n 8 -f bin poss &
./pool -d test/fifty_entries -b 7 -n 8 -f bin poss &When -f bin is used, a binary file with partial stats is written to the pool directory.
The partial stats file names are poss_N_of_M.bin.
After producing the files (possibly on different machines), collect them into the pool directory, then generate the possibilities report:
./pool -d test/fifty_entries -r possUsing this method and a compute cloud of 4,096 Spot instances, we can generate a possibilities
report after Day 1 is complete in about 4 hours at a cost of roughly $280. See aws/ for a
complete implementation using AWS Batch.
On a single machine (M1 Mac), run one process per core using parallel.sh (set PROCS to your
core count). At 2.3M brackets/sec per process, a 12-core machine can generate the report
in under 1 hour once there are 36 games remaining (about 37 teams), taking roughly 41
minutes. With fewer cores the threshold is lower — for N cores it is when
2^(games remaining) / (N × 2,300,000) falls under 3,600 seconds.
When there are more than ~36 games remaining (e.g. immediately after Day 1 with
31 games left), the exhaustive DFS is not practical on a single machine. The mc
command runs a fast Monte Carlo simulation instead: it generates N random tournament
completions, scores all entries against each one, and reports win probabilities with
statistical error margins.
Game outcomes are sampled using one of two selection modes, controlled by the -m
flag.
-m seed (default) — Seed-weighted probability. For a matchup between seed S1
and seed S2, the probability that the lower-seeded team wins is S2 / (S1 + S2)
(e.g. a 1 vs 16 gives the 1-seed a ~94% chance; a 5 vs 12 gives ~71%). Simple and
requires no extra data, but is a rough approximation.
-m model — Statistical model weighted by 24 team statistics sourced from
algebracket.com. Each team is scored by computing a
weighted sum of its normalized statistics; the win probability for a matchup is
score1 / (score1 + score2). This requires two additional files in the pool
directory:
-
weights.json— a JSON object mapping each of the 24 stat IDs to its weight (0–10). Use your own slider values from algebracket.com to reflect your own prediction preferences. The JSON key for each slider is derived from the stat name by removing lowercase letters, spaces, and punctuation (replacing%withP):JSON key Stat name JSON key Stat name SeedSeed OTSPOpp. True Shoot % WPWin % PPace SSSoS TPTurnover % PGPts / Gm OTPOpp. Turnover % OPGOpp Pts / Gm TMTurnover Margin FGPFG % APAssist % 3PFGP3Pt FG % ATAssists / Turnover FTPFree Throw % FTFGAFT / FGA OROffense Rating OFTFGAOpp. FT / FGA DRDefense Rating RPRebound % ASMAdj. Score Margin ORPOff. Rebound % EFGPEffective FG % TSPTrue Shooting % -
stats.csv— per-team statistics in the algebracket CSV format (one row per team, normalized stats in columns). The file must begin with front-matter comment lines that map each anglebracket region integer to its pool region name, so teams can be matched by (region, seed) rather than by name:
# region 0: East
# region 1: South
# region 2: West
# region 3: Midwest
Rank,Name,Games Won,Region,Seed,Win %,SoS,...
./pool -d mypool -s 1000000 mc # seed-weighted (default)
./pool -d mypool -s 1000000 -m model mc # algebracket modelWith 1,000,000 simulations the worst-case statistical margin of error is ±0.1% at
95% confidence. Runtime is a few seconds on any modern machine. The -p flag shows
progress, and -f json outputs JSON. The -s flag controls sample count (default
1,000,000).
The output table mirrors the poss report format with an inline ±margin on the
Win% column showing the statistical uncertainty for each entry's win probability.
Included in the repo is a ruby script that will generate a HTML status report page that you can either host on a web server or send around for opening in a web browser. It is completely stand alone. To create it run the ruby web/status/generate.rb script. The script is likewise completely stand alone, requiring only Ruby 3.x and relying on no third party gems. You can run it at any time, including before the tournament starts. It will generate a history of the leader board and possibilities as the tournament progresses. You should run it before the tourny begins, after each day in the first weekend, and then after the Sweet Sixten, Elite Eight, Final Four and Championship game. It is ok to run it in between these milestones if you want to.
See a sample here: HTML Status Report
To run:
ruby ./web/status/generate.rb -d <POOL_DIR> -o path/to/status.htmlGame numbers by round and what game numbers teams will play in as they advance. Read across and up.
Game indexes
Round
Seeds Teams 1 2 3 4 5 Championship
1-16 t1 t2: 0 32--48--56--60--62
8-9 t3 t4: 1--+ | | | |
5-12 t5 t6: 2 33---+ | | |
4-13 t7 t8: 3--+ | | |
6-11 t9 t10: 4 34--49 --+ | |
3-14 t11 t12: 5--+ | | |
7-10 t13 t14: 6 35---+ | |
2-15 t15 t16: 7--+ | |
1-16 t17 t18: 8-36--50--57---+ |
8-9 t19 t20: 9 |
5-12 t21 t22: 10 37 |
4-13 t23 t24: 11 |
6-11 t25 t26: 12 38 51 |
3-14 t27 t28: 13 |
7-10 t29 t30: 14 39 |
2-15 t31 t32: 15 |
|
1-16 t33 t34: 16 40 52 58 61---+
8-9 t35 t36: 17
5-12 t37 t38: 18 41
4-13 t39 t40: 19
6-11 t41 t42: 20 42 53
3-14 t43 t44: 21
7-10 t45 t46: 22 43
2-15 t47 t48: 23
1-16 t49 t50: 24 44 54 59
8-9 t51 t52: 25
5-12 t53 t54: 26 45
4-13 t55 t56: 27
6-11 t57 t58: 28 46 55
3-14 t59 t60: 29
7-10 t61 t62: 30 47
2-15 t63 t64: 31