fuzzy matching rewrite using new function get_matching_blocks() #827

friday · 2021-10-12T14:11:00Z

Replaces #824.

I wasn't planning to, but I reimplemented this for the v6 branch instead, with the major difference that it uses a new method get_matching_blocks() which only exists in this branch (because it doesn't behave exactly the same in some cases).

Was finally able to replace those last 30 lines of code to now to a much more sensible solution thanks to get_matching_blocks(), and also removed the condition #824 reintroduced.

The character weighting is much smaller, and the new code doesn't bump up everything to 100% like the old one did with very short queries.

The highlighter and matcher uses the same implementation now (even the same function call, since it's memoized). Before this, the highligher used a completely different implementation and sometimes would display misleading results.

Might need some need tuning still, but I'm happy with the results, because getting the matchmaking blocks means we can understand it better and score it better, with less code duplication.

troycurtisjr · 2021-10-16T13:15:41Z

Oh yeah this is really nice. The implementation is much improved from a code/logic standpoint, and the results are right on. I haven't been able to find any cases yet where the results did not line up with my expectations! 👍 👍

As far as I'm concerned this is ready to merge into v6.

friday · 2021-10-16T15:59:04Z

Oh yeah this is really nice. The implementation is much improved from a code/logic standpoint, and the results are right on. I haven't been able to find any cases yet where the results did not line up with my expectations! +1 +1

As far as I'm concerned this is ready to merge into v6.

I can think of a couple of cases, but I think the results are generally good enough not to have to handle them.

The code that checks if there's a word boundary could maybe include other white-space characters or all non-alphanumericals.
With the example "FiWeBr" matching "Firefox Web Browser" it actually matches "Firefox Web Browser" now, so the code will incorrectly give it a slight penalty for the last letter failing the word boundary condition. But this is how the Levenstein library matches it, and it's way more efficient than any code we could write. And if it's just one letter the penalty is so small it won't matter.

friday mentioned this pull request Oct 12, 2021

Improve fuzzy matching (again) #824

Closed

friday force-pushed the v6-fuzzy-rework branch 2 times, most recently from 1518008 to 7d7a807 Compare October 13, 2021 05:10

friday force-pushed the v6 branch from f9d04fd to de1cd32 Compare October 13, 2021 05:12

fuzzy matching rewrite using the new get_matching_blocks()

bfa8c05

friday force-pushed the v6-fuzzy-rework branch from 7d7a807 to bfa8c05 Compare October 13, 2021 05:18

troycurtisjr self-requested a review October 16, 2021 13:15

troycurtisjr approved these changes Oct 16, 2021

View reviewed changes

friday merged commit 6efc86a into v6 Oct 16, 2021

friday deleted the v6-fuzzy-rework branch October 16, 2021 15:47

friday mentioned this pull request Jan 23, 2022

Can I make the search "less fuzzy"? #949

Closed

3 tasks

friday mentioned this pull request Mar 29, 2022

Ulauncher v6 #869

Open

friday added this to the 6.0.0 milestone Apr 17, 2022

friday mentioned this pull request May 14, 2025

Getting focused to wrong match #1466

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fuzzy matching rewrite using new function get_matching_blocks() #827

fuzzy matching rewrite using new function get_matching_blocks() #827

Uh oh!

friday commented Oct 12, 2021

Uh oh!

troycurtisjr commented Oct 16, 2021

Uh oh!

friday commented Oct 16, 2021

Uh oh!

Uh oh!

fuzzy matching rewrite using new function get_matching_blocks() #827

fuzzy matching rewrite using new function get_matching_blocks() #827

Uh oh!

Conversation

friday commented Oct 12, 2021

Uh oh!

troycurtisjr commented Oct 16, 2021

Uh oh!

friday commented Oct 16, 2021

Uh oh!

Uh oh!