Skip to content

Commit

Permalink
Add some more test cases for tokenization and ascii folding
Browse files Browse the repository at this point in the history
  • Loading branch information
SiarheiFedartsou committed Dec 7, 2024
1 parent 5c90483 commit f038644
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions integration/analyzer_peliasQuery.js
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,9 @@ module.exports.tests.analyze = function(test, common){
suite.action( function( done ){ setTimeout( done, 500 ); }); // wait for es to bring some shards up

assertAnalysis('tokenizer', 'foo-bar baz/42', ['foo','bar','baz','42']);
assertAnalysis('tokenizer', 'foo-bar baz/42', ['foo','bar','baz','42']); // tab instead of space
assertAnalysis('tokenizer', 'foo---bar baz/42', ['foo','bar','baz','42']);
assertAnalysis('tokenizer', 'foo—bar baz/42', ['foobar','baz','42']); // dash is not a hyphen
assertAnalysis('tokenizer', 'foo-bar baz//42', ['foo','bar','baz','42']);
assertAnalysis('tokenizer', 'foo bar baz 42', ['foo','bar', 'baz', '42']);
assertAnalysis('tokenizer', 'foo-bar baz\\42', ['foo', 'bar','baz', '42']);
Expand Down

0 comments on commit f038644

Please sign in to comment.