Closed Bug 670601 Opened 13 years ago Closed 9 years ago

[ro] Romanian language dependent font rendering and text search

Tracking

()

Status:

RESOLVED DUPLICATE of bug 374795

People

(Reporter: cristian.adam, Unassigned)

References

Details

(Whiteboard: [ro])

Attachments

(6 files)

bug670601_chrome28.png 11 years ago Cristian Adam 48.32 KB, image/png		Details
bug670601_chrome28_search.png 11 years ago Cristian Adam 26.65 KB, image/png		Details
bug670601_ff22.png 11 years ago Cristian Adam 151.81 KB, image/png		Details
bug670601_ff22_search.png 11 years ago Cristian Adam 29.30 KB, image/png		Details
bug670601_ie10.png 11 years ago Cristian Adam 42.34 KB, image/png		Details
bug670601_ie10_search.png 11 years ago Cristian Adam 22.28 KB, image/png		Details

Cristian Adam

Reporter

Description

•

13 years ago

User Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:5.0) Gecko/20100101 Firefox/5.0
Build ID: 20110615151330

Steps to reproduce:

Search after "Țț Șș" in the following page.

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd"> 
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8"/> 
<style type="text/css">
h1 {
	font-family: Georgia;
	font-size: 36px;
	font-weight: normal;	
	margin: 10px 0 20px 0;
}
</style>
</head>

<h1 lang="ro">&#354;&#355; &#350;&#351; (cedilla)</h1>
<h1 lang="ro">&#538;&#539; &#536;&#537; (comma) </h1>

</html>


Actual results:

Search results only in one match, even though I see the same characters twice, due to the new language dependent font rendering.



Expected results:

Firefox should have matched both text instances, which look identical for the user.

AndreiD[QA]

Comment 1

•

13 years ago

I can reproduce the issue on:
Mozilla/5.0 (Windows NT 6.1; WOW64; rv:8.0a1) Gecko/20110710 Firefox/8.0a1

Although, if I have the 2 Romanian keyboard inputs: Legacy and Standard, I get a search result with Legacy on for the "ţ" character on the first row and with the Standard input on for the same character on the second row.
I don't think it's actually a Firefox bug but a question for the web developer which Romanian standard to choose.
Cristian, please change this status to resolved if consider it this way. Thanks

Cristian Adam

Reporter

Comment 2

•

13 years ago

Firefox should do the same character promotion (cedilla -> comma) on search.

http://www.capisci.ro/ is affected by the locale dependent font rendering and I would expect searching for text that I see on the webpage to work.

I know that doesn't fall into strcmp/stricmp case. I think that Firefox should do more in this area.

Google Chrome has better support for collation, because they use ICU, thus it can find matches for any of "Țț Șș", "Ţţ Şş" or "Tt Ss" strings.

Cristian Adam

Reporter

Comment 3

•

13 years ago

I was talking about strstr instead of strcmp/stricmp. I should have mentioned directly nsString::Find (https://developer.mozilla.org/en/nsString#Find), which lacks any locale / collation support.

AndreiD[QA]

Comment 4

•

13 years ago

Confirming this on:
Mozilla/5.0 (Windows NT 6.1; WOW64; rv:8.0a1) Gecko/20110710 Firefox/8.0a1

Status: UNCONFIRMED → NEW

Ever confirmed: true

Mats Palmgren (inactive)

Comment 5

•

13 years ago

Duplicate of bug 202251?

Component: General → Find Backend

Product: Firefox → Core

QA Contact: general → find-backend

Cristian Adam

Reporter

Comment 6

•

13 years ago

Fixing bug 202251 would also fix this one. 

But in this case we're searching after a text that is actually displayed in the web page namely "Țț Șș" and not after "Tt Ss" which would be the case of the bug 202251.

Boris Zbarsky [:bzbarsky]

Updated

•

13 years ago

Depends on: 202251

Raul Nicolae Malea

Updated

•

13 years ago

Blocks: 632886

Raul Nicolae Malea

Updated

•

11 years ago

Whiteboard: [ro]

Raul Nicolae Malea

Comment 7

•

11 years ago

This bug is still present?

Summary: Romanian language dependent font rendering and text search → [ro] Romanian language dependent font rendering and text search

Cristian Adam

Reporter

Comment 8

•

11 years ago

The bug is still present in Mozilla Firefox 22.

I have tested on an English Windows 8 installation with Romanian locale configured the following browsers:

Browser name           | Same characters rendered | Both characters highlighted
--------------------------------------------------------------------------------
Mozilla Firefox 22     |          yes             |              no
Google Chrome 28       |           no             |              yes
Internet Explorer 10   |           no             |              yes

I've attached screen shots to confirm my findings.

Cristian Adam

Reporter

Comment 9

•

11 years ago

Attached image bug670601_chrome28.png — Details

Cristian Adam

Reporter

Comment 10

•

11 years ago

Attached image bug670601_chrome28_search.png — Details

Cristian Adam

Reporter

Comment 11

•

11 years ago

Attached image bug670601_ff22.png — Details

Cristian Adam

Reporter

Comment 12

•

11 years ago

Attached image bug670601_ff22_search.png — Details

Cristian Adam

Reporter

Comment 13

•

11 years ago

Attached image bug670601_ie10.png — Details

Cristian Adam

Reporter

Comment 14

•

11 years ago

Attached image bug670601_ie10_search.png — Details

Raul Nicolae Malea

Updated

•

11 years ago

Blocks: 907793

Raul Nicolae Malea

Comment 15

•

11 years ago

Any update for this?

Cristian Adam

Reporter

Comment 16

•

9 years ago

Bug 1128330 adds another case in which find is limited namely characters generated using combined diacritical marks.

This time there is no visual difference between s comma below (&#x0219;) and s + comma below (&#x0073;&#x0326;). The user will be for sure confused.

See https://www.assembla.com/code/cristianadam/subversion/node/blob/webpages/diacritice/test_diacritice_combinatorii.html?&rev=88 (souce code also at: http://pastebin.com/ir5QWTt1)

Google Chrome doesn't have this problem, it can find everything!

Cristian Adam

Reporter

Comment 17

•

9 years ago

(In reply to Cristian Adam from comment #16)
> 
> This time there is no visual difference between s comma below (&#x0219;) and
> s + comma below (&#x0073;&#x0326;). The user will be for sure confused.
> 

Forgot that this bug was about the inability of the user to visually distinguish between s and t comma below and s and t cedilla because of the "locl" promotion :)

I've uploaded the test code from the description here: https://www.assembla.com/code/cristianadam/subversion/node/blob/webpages/diacritice/test_diacritice_locl.html?rev=92

Now there are two cases when the user will not find what words that (s)he sees on the webpage.

Gingerbread Man

Comment 18

•

9 years ago

(In reply to Cristian Adam from comment #14)
> Created attachment 783943 [details]

I can't reproduce this with IE11 in Windows 7. It behaves the same as Firefox.

(In reply to Cristian Adam from comment #16)
> Google Chrome doesn't have this problem, it can find everything!

Bug 1147464. Same as Google Search, Chrome may very well find something other than what was intended. For example, it doesn't differentiate between S Ș Ş Š Ŝ Ś.

Status: NEW → RESOLVED

Closed: 9 years ago

Resolution: --- → DUPLICATE