Open Bug 1732530 Opened 9 months ago Updated 9 months ago

In "Find in page" (Ctrl-F), U+2212 MINUS SIGN also matches the ASCII U+002D HYPHEN-MINUS

Categories

(Toolkit :: Find Toolbar, defect)

Firefox 88
defect

Tracking

()

Tracking Status
firefox92 --- affected
firefox93 --- affected
firefox94 --- affected

People

(Reporter: vincent-moz, Unassigned)

Details

User Agent: Mozilla/5.0 (X11; Linux x86_64; rv:88.0) Gecko/20100101 Firefox/88.0

Steps to reproduce:

  1. Open an HTML page containing U+2212 MINUS SIGN (−) and ASCII U+002D HYPHEN-MINUS (-) characters (this bug page should be such an example).
  2. Type [Ctrl-F] [−] to find U+2212 MINUS SIGN characters in the page.

Actual results:

ASCII U+002D HYPHEN-MINUS (-) characters are also matched.

Expected results:

Only U+2212 MINUS SIGN (−) characters should match.

This is a regression. Old Firefox versions did not have this issue.

And searching for U+002D HYPHEN-MINUS (-) also incorrectly finds U+2212 MINUS SIGN (−) characters.

I've just noticed that this issue disappears if "Match Diacritics" is selected. However, diacritics are about accented letters vs based letters, as described in the tooltip, not about U+2212 MINUS SIGN vs U+002D HYPHEN-MINUS.

Hi Vincent,

Thank you for your report. I tested this on Windows 10 Firefox Nightly 94.0a1 (2021-10-01) (64-bit), Beta 93.0, Release 90.0.2 (64-bit) and 92.0.1 (64-bit).
I'll mark this issue as NEW and add it to the corresponding team for visibility.

In the meantime, since you mentioned this is a regression, would you share the firefox version where you saw that this issue was not happening? This information will be useful to the investigation.

Thanks in advance,
Virginia

Severity: -- → S3
Status: UNCONFIRMED → NEW
Has Regression Range: --- → no
Has STR: --- → yes
Component: Untriaged → Find Toolbar
Ever confirmed: true
Flags: needinfo?(vincent-moz)
Product: Firefox → Toolkit

(In reply to Virginia Balducci from comment #3)

In the meantime, since you mentioned this is a regression, would you share the firefox version where you saw that this issue was not happening?

Firefox 52.9.0.

Flags: needinfo?(vincent-moz)

Related to that, "Match Diacritics" also has an effect on whether U+0027 APOSTROPHE (') matches U+2019 RIGHT SINGLE QUOTATION MARK (’) or not, at least on French pages. In French, an apostrophe is not a diacritic. In particular, one may want to match diacritics (to avoid false positives), but still have the U+0027 / U+2019 equivalence due to inconsistencies on web pages in practice.

You need to log in before you can comment on or make changes to this bug.