Swift code for the 1billion row challenge

This is a swift implementation for the 1 billion row challenge on https://github.com/gunnarmorling/1brc .

The code is in https://github.com/pfy/1brc/blob/main/scanner/main.swift

The idea is the following:

MMAP the file without loading it
Split the file at newline boundaries for every core
For every split, run a block on every core with an operation queue
Get one accumulator dictionary per thread
inside the block, get raw byte access and iterate over every byte
find the semicolon, while accumulating a byte containing the first 8 bytes of the name
still finding the semicolon, after 8 bytes only calculate the hash
use a pointer to the city name, do not copy anything
after the semicolon, get the sign or the first number. the number is stored as int * 10. use the number parsing logic from https://github.com/dannyvankooten/1brc/blob/main/analyze.c#L39
accumulate numbers until we reach a newline, ignoring the point since the test datas always have one number after the point
generate a key, containing our hash as Int (Hashable needs an int ..)
find or update the statistics element for the city in our special hash. use the find function to get an index
merge the results from all threads
print the output

We use a special hashmap, which

can use a predefined hash value
can get us the index of a key or the next free element and supports insert by index

Results

(after warm, best of 3)

on my m1 pro laptop

      Model Name: MacBook Pro
      Model Identifier: MacBookPro18,2
      Model Number: Z14Y0007KSM/A
      Chip: Apple M1 Max
      Total Number of Cores: 10 (8 performance and 2 efficiency)
      Memory: 64 GB

./scanner measurements.txt  17.53s user 1.36s system 796% cpu 2.371 total

on a mac studio

      Model Name: Mac Studio
      Model Identifier: Mac13,2
      Model Number: Z14K0002CSM/A
      Chip: Apple M1 Ultra
      Total Number of Cores: 20 (16 performance and 4 efficiency)
      Memory: 64 GB

./scanner measurements.txt  19.40s user 2.55s system 1270% cpu 1.729 total

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
scanner.xcodeproj		scanner.xcodeproj
scanner		scanner
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Swift code for the 1billion row challenge

Results

on my m1 pro laptop

on a mac studio

About

Releases

Packages

Languages

pfy/1brc

Folders and files

Latest commit

History

Repository files navigation

Swift code for the 1billion row challenge

Results

on my m1 pro laptop

on a mac studio

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages