Improve Git & Maven config #1

nipafx · 2023-12-29T15:19:07Z

No description provided.

gunnarmorling · 2023-12-29T17:30:48Z

Very nice, thanks a lot!

* Submission #1 * Submission #1 (Fixed casing of file names) * Submission #1 (Added executable to Git permissions) * Submission 1 (Fixed incorrect map size) * Submission 1 (Fixed output problems on Windows)

…me further by 10%. As the jvm exits with exit(0) syscall, the kernel reclaims the memory mappings via munmap() call. Prior to this change. all the unmap() calls were happening right at the end as the JVM exited. This led to serial execution of about 350ms out of 2500 ms right at the end after each shard completed its work. We can parallelize it by exposing the Cleaner from MappedByteBuffer and then ensure that it is truly parallel execution of munmap() by using a non-blocking lock (SeqLock). The optimal strategy for when each thread must call unmap() is an interesting math problem with an exact solution and this code roughly reflects it. Commit gunnarmorling#3: Tried out reading long at a time from bytebuffer and checking for presence of ';'.. it was slower compared to just reading int(). Removed the code for reading longs; just retaining the hasSemicolonByte(..) check code Commit gunnarmorling#2: Introduce processLineSlow() and processRangeSlow() for the tial part. Commit gunnarmorling#1: Create a separate tail piece of work for the last few lines to be processed separately from the main loop. This allows the main loop to read past its allocated range (by a 'long' if we reserve atleast 8 bytes for the tail piece of work.)

…m 16th based on local testing; no Unsafe; no bitwise tricks yet (#465) * Squashing a bunch of commits together. Commit#2; Uplift of 7% using native byteorder from ByteBuffer. Commit#1: Minor changes to formatting. * Commit #4: Parallelize munmap() and reduce completion time further by 10%. As the jvm exits with exit(0) syscall, the kernel reclaims the memory mappings via munmap() call. Prior to this change. all the unmap() calls were happening right at the end as the JVM exited. This led to serial execution of about 350ms out of 2500 ms right at the end after each shard completed its work. We can parallelize it by exposing the Cleaner from MappedByteBuffer and then ensure that it is truly parallel execution of munmap() by using a non-blocking lock (SeqLock). The optimal strategy for when each thread must call unmap() is an interesting math problem with an exact solution and this code roughly reflects it. Commit #3: Tried out reading long at a time from bytebuffer and checking for presence of ';'.. it was slower compared to just reading int(). Removed the code for reading longs; just retaining the hasSemicolonByte(..) check code Commit #2: Introduce processLineSlow() and processRangeSlow() for the tial part. Commit #1: Create a separate tail piece of work for the last few lines to be processed separately from the main loop. This allows the main loop to read past its allocated range (by a 'long' if we reserve atleast 8 bytes for the tail piece of work.)

* Latest snapshot (#1) preparing initial version * Improved performance to 20seconds (-9seconds from the previous version) (#2) improved performance a bit * Improved performance to 14 seconds (-6 seconds) (#3) improved performance to 14 seconds * sync branches (#4) * initial commit * some refactoring of methods * some fixes for partitioning * some fixes for partitioning * fixed hacky getcode for utf8 bytes * simplified getcode for partitioning * temp solution with syncing * temp solution with syncing * new stream processing * new stream processing * some improvements * cleaned stuff * run configuration * round buffer for the stream to pages * not using compute since it's slower than straightforward get/put. using own byte array equals. * using parallel gc * avoid copying bytes when creating a station object * formatting * Copy less arrays. Improved performance to 12.7 seconds (-2 seconds) (#5) * initial commit * some refactoring of methods * some fixes for partitioning * some fixes for partitioning * fixed hacky getcode for utf8 bytes * simplified getcode for partitioning * temp solution with syncing * temp solution with syncing * new stream processing * new stream processing * some improvements * cleaned stuff * run configuration * round buffer for the stream to pages * not using compute since it's slower than straightforward get/put. using own byte array equals. * using parallel gc * avoid copying bytes when creating a station object * formatting * some tuning to increase performance * some tuning to increase performance * avoid copying data; fast hashCode with slightly more collisions * avoid copying data; fast hashCode with slightly more collisions * cleanup (#6) * tidy up

nipafx added 3 commits December 29, 2023 16:15

Make Git ignore all measurements* files

3a71189

Don't check measurements* files for a license header

6d6f0f2

Don't reformat weather station comment

2495b93

gunnarmorling merged commit d53b3aa into gunnarmorling:main Dec 29, 2023
1 check passed

This was referenced Jan 5, 2024

Adding more speed improvements, going for first again. #141

Merged

merykitty's attempt #114

Merged

Adding Unsafe and merykitty's branchless parser #194

Merged

Second tuning for thomaswue #263

Merged

Jesse-Van-Rooy added a commit to Jesse-Van-Rooy/1brc that referenced this pull request Jan 11, 2024

Submission gunnarmorling#1

ba1d1b8

Jesse-Van-Rooy added a commit to Jesse-Van-Rooy/1brc that referenced this pull request Jan 11, 2024

Submission gunnarmorling#1 (Fixed casing of file names)

74ed850

Jesse-Van-Rooy added a commit to Jesse-Van-Rooy/1brc that referenced this pull request Jan 11, 2024

Submission gunnarmorling#1 (Added executable to Git permissions)

2cfe183

This was referenced Jan 15, 2024

Improve scheduling for thomaswue #358

Merged

MahmoudFawzyKhalil's implementation #438

Merged

gunnarmorling mentioned this pull request Jan 21, 2024

Tuning and subprocess spawn for thomaswue #533

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Git & Maven config #1

Improve Git & Maven config #1

nipafx commented Dec 29, 2023

gunnarmorling commented Dec 29, 2023

Improve Git & Maven config #1

Improve Git & Maven config #1

Conversation

nipafx commented Dec 29, 2023

gunnarmorling commented Dec 29, 2023