Extreme amount of incorrect language detected #7088
-
For some odd reason, the language on my TinyTapeout Repository https://github.com/gfcwfzkm/tt09-scope-bfh-mht1_3 is detected heavily as F# - despite containing no code with F#. It is so extreme, that my whole githup account statistics show now F# as the dominant language. On top of that, in the same repo, a considerable amount of Verilog is detected, yet when searched for it, only 6 little files contain Verilog at all. Any idea how I can fix that? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
The reason you are seeing this is because of these two files:
The linguist/lib/linguist/heuristics.yml Lines 277 to 286 in f164d13 If this doesn't match, things fall through to the classifier which makes a best effort guess based on the samples we have, which in this case isn't the right guess. Both of these files are pretty large at 5.7MB each so nearly 12MB of your entire repo is considered F#. These won't appear in the search results because of their size (the number in the second column is in bytes): $ github-linguist .
93.21% 11948739 F#
3.83% 490822 HTML
1.75% 223885 Verilog
1.18% 151042 VHDL
0.01% 1216 Tcl
0.01% 1103 Makefile
0.01% 1020 Dockerfile
0.01% 755 Stata
0.01% 679 Python
0.00% 106 Shell
$
This would be due to these file:
... with
You can use an override. |
Beta Was this translation helpful? Give feedback.
The reason you are seeing this is because of these two files:
The
.fs
extension is associated with multiple languages so we rely on this heuristic to try and determine the language:linguist/lib/linguist/heuristics.yml
Lines 277 to 286 in f164d13