Closed Bug 1845006 Opened 1 year ago Closed 1 year ago

Fix a few intertwined bugs with data URL parsing (double-parsing of the mime type, not fully serializing them, and doubling of charsets in content-type headers)

Tracking

()

Status:

RESOLVED FIXED

Milestone:

118 Branch

Tracking Flags:

Tracking

Status

firefox118

---

fixed

People

(Reporter: twisniewski, Assigned: twisniewski)

References

Details

(Whiteboard: [necko-triaged])

Attachments

(1 file)

Bug 1845006 - store the fully-serialized MimeType on data url channels so XHR and fetch may use it for content-type response headers, and clean up the data url parsing code to better match the spec. r?kershaw,sunil 1 year ago Thomas Wisniewski [:twisniewski] 48 bytes, text/x-phabricator-request		Details \| Review

Thomas Wisniewski [:twisniewski]

Assignee

Description

•

1 year ago

•

Edited

Basically, we already parse the mime type part of the URL here: https://searchfox.org/mozilla-central/source/netwerk/protocol/data/nsDataChannel.cpp#61
But SetContentType and SetContentCharset at the end of the function will also re-parse the result, using our older parser which isn't saving the parameters of the mime type: https://searchfox.org/mozilla-central/source/netwerk/protocol/data/nsDataChannel.cpp#102-103
We should just set mContentType and mContentCharset in the first line above.

Per spec we should also be serializing the full resulting mimetype here, not just getting its essence: https://searchfox.org/mozilla-central/source/netwerk/protocol/data/nsDataHandler.cpp#189

Finally, the fetch and XHR code doubles-up the charset needlessly in these spots, so we should tell it to only add the charset if it's not already present:

Thomas Wisniewski [:twisniewski]

Assignee

Comment 1

•

1 year ago

•

Edited

https://treeherder.mozilla.org/jobs?repo=try&revision=536704b4eb927bf7ca6d1b9f5e5186339fde57b9

Thomas Wisniewski [:twisniewski]

Assignee

Updated

•

1 year ago

Summary: Use CMimeType::Parse in nsBaseChannel::setContentType, instead of net_ParseContentType → Fix a few bugs with data URL parsing (double-parsing of the mime type, not fully serializing them, and doubling of charsets in content-type headers)

Thomas Wisniewski [:twisniewski]

Assignee

Updated

•

1 year ago

Summary: Fix a few bugs with data URL parsing (double-parsing of the mime type, not fully serializing them, and doubling of charsets in content-type headers) → Fix a few intertwined bugs with data URL parsing (double-parsing of the mime type, not fully serializing them, and doubling of charsets in content-type headers)

Sunil Mayya

Updated

•

1 year ago

Severity: -- → S3

Priority: -- → P2

Whiteboard: [necko-triaged]

Thomas Wisniewski [:twisniewski]

Assignee

Comment 2

•

1 year ago

It turns out that too much Gecko code relies on whatever "content type" happens to mean right now, which is not "whatever is specified by the data url as per the spec", but seems to be more along the lines of the essence of the MimeType (more or less). As such it's too risky to change all of that, so I've taken a new approach:

update our data url parsing code to follow the spec text more closely to pass these WPTs.
have it put the full MimeType in a new instance var on the data url channel
have XHR/fetch use that new value for content-type response headers on data urls.

New try run here: https://treeherder.mozilla.org/jobs?repo=try&revision=657429472db615aa68036814b8e8ead2a51409f5

Thomas Wisniewski [:twisniewski]

Assignee

Comment 3

•

1 year ago

Attached file Bug 1845006 - store the fully-serialized MimeType on data url channels so XHR and fetch may use it for content-type response headers, and clean up the data url parsing code to better match the spec. r?kershaw,sunil — Details

Phabricator Automation

Updated

•

1 year ago

Assignee: nobody → twisniewski

Status: NEW → ASSIGNED

Pulsebot

Comment 4

•

1 year ago

Pushed by twisniewski@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/8d98996f824c store the fully-serialized MimeType on data url channels so XHR and fetch may use it for content-type response headers, and clean up the data url parsing code to better match the spec. r=kershaw,sunil,necko-reviewers

Cristina Horotan [:chorotan]

Comment 5

•

1 year ago

Backed out changeset 8d98996f824c (Bug 1845006) for causing build bustage at FetchDriver.cpp

Backout: https://hg.mozilla.org/integration/autoland/rev/23140ddbeb8d9d3b91d2e3111b0e640f249e1c02

Failure push: https://treeherder.mozilla.org/jobs?repo=autoland&group_state=expanded&resultStatus=testfailed%2Cbusted%2Cexception%2Cretry%2Cusercancel&revision=8d98996f824c284a7c7ad1d4f2fa8872868139e9

Failure log: https://treeherder.mozilla.org/logviewer?job_id=424652254&repo=autoland&lineNumber=9467

Flags: needinfo?(twisniewski)

Thomas Wisniewski [:twisniewski]

Assignee

Comment 6

•

1 year ago

Ah, apologies. This is a quick fix I should have caught by rebuilding first after rebasing, then landing. Fix incoming later today.

Flags: needinfo?(twisniewski)

Pulsebot

Comment 7

•

1 year ago

Pushed by twisniewski@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/f96e24bbd71c store the fully-serialized MimeType on data url channels so XHR and fetch may use it for content-type response headers, and clean up the data url parsing code to better match the spec. r=kershaw,sunil,necko-reviewers

Narcis Beleuzu [:NarcisB]

Comment 8

•

1 year ago

Backed out for bustages on nsDataHandler.cpp

Backout link: https://hg.mozilla.org/integration/autoland/rev/d490ff4c8be402a348becc31bc38bc4bf45193f0
Log link: https://treeherder.mozilla.org/logviewer?job_id=424688770&repo=autoland&lineNumber=31345

Flags: needinfo?(twisniewski)

Pulsebot

Comment 9

•

1 year ago

Pushed by twisniewski@mozilla.com: https://hg.mozilla.org/integration/autoland/rev/0d9bc1bc0799 store the fully-serialized MimeType on data url channels so XHR and fetch may use it for content-type response headers, and clean up the data url parsing code to better match the spec. r=kershaw,sunil,necko-reviewers

Serban Stanca [:SerbanS]

Comment 10

•

1 year ago

bugherder

https://hg.mozilla.org/mozilla-central/rev/0d9bc1bc0799

Status: ASSIGNED → RESOLVED

Closed: 1 year ago

status-firefox118: --- → fixed

Resolution: --- → FIXED

Target Milestone: --- → 118 Branch

Thomas Wisniewski [:twisniewski]

Assignee

Updated

•

1 year ago

Flags: needinfo?(twisniewski)

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Fix a few intertwined bugs with data URL parsing (double-parsing of the mime type, not fully serializing them, and doubling of charsets in content-type headers)

Categories

(Core :: Networking, defect, P2)

Tracking

()

People

(Reporter: twisniewski, Assigned: twisniewski)

References

Details

(Whiteboard: [necko-triaged])

Crash Data

Security

(public)

User Story

Attachments

(1 file)

Description

Comment 1

Updated

Updated

Updated

Comment 2

Comment 3

Updated

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Updated

Attachment

General

Description

File Name

Content Type