Closed Bug 475227 Opened 16 years ago Closed 16 years ago

LCMS SSE2 optimization for Windows x64

Tracking

()

Status:

RESOLVED WONTFIX

People

(Reporter: m_kato, Assigned: m_kato)

References

Details

Attachments

(1 file, 1 obsolete file)

patch v1 16 years ago Makoto Kato [:m_kato] 3.50 KB, patch		Details \| Diff \| Splinter Review
patch v1.1 16 years ago Makoto Kato [:m_kato] 3.55 KB, patch	bholley : review+	Details \| Diff \| Splinter Review

Makoto Kato [:m_kato]

Assignee

Description

•

16 years ago

We have added SSE2 optimization as bug 445552 and bug 456028. But this is for MSVC++ x86 and GCC only. I will add it for MSVC++ x64 that isn't support inline assembler.

Makoto Kato [:m_kato]

Assignee

Comment 1

•

16 years ago

Attached patch patch v1 (obsolete) — Details — Splinter Review

Bobby Holley (:bholley)

Comment 2

•

16 years ago

thanks for the initiative on these patches makoto. I think we should definitely have a path with intrinsics for x86_64 msvc++ since it doesn't support inline assembler. The reason I didn't originally write this stuff with intrinsics is that vlad was worried that some of the code would end up less efficient that way. Once we verify that this code works for win64, I'd be curious to run some performance numbers on regular x86 of intrinsics vs inline assembler.

Makoto Kato [:m_kato]

Assignee

Comment 3

•

16 years ago

Attached patch patch v1.1 — Details — Splinter Review

Attachment #358691 - Attachment is obsolete: true

Attachment #362537 - Flags: review?(bholley)

Bobby Holley (:bholley)

Comment 4

•

16 years ago

Patch looks pretty good. Before I r+ it though, I'd like to take a look at the results myself. I'm compiling the intrinsics version on my macbook. I'll post once I test it out.

Bobby Holley (:bholley)

Comment 5

•

16 years ago

arg, I built when the tree wasn't green. I'll get back to this tomorrow. Makoto - do you have any good way to test the performance of this patch?

Makoto Kato [:m_kato]

Assignee

Comment 6

•

16 years ago

This code is only windows x64 build only. (you need to get all patches from my tree (http://hg.mozilla-x86-64.com/). So if except to VC++ on x86_64, it uses previous code. Also, if you use other platform, you need to change that compiler uses this code path. gcc 4.x for intel and VC++ 8.0 for x86 can use these built-in SSE2 instruction such as _mm_*. I have tested on x86_64 Ubuntu/Fedora gcc by modifying code.

Bobby Holley (:bholley)

Comment 7

•

16 years ago

Comment on attachment 362537 [details] [diff] [review] patch v1.1 ok, looks good. r=bholley. Flagging vlad for sr. I'm running a modified version of this patch through the tryserver to see how the intrinsics fare against the raw assembly on mac/linux.

Attachment #362537 - Flags: superreview?(vladimir)

Attachment #362537 - Flags: review?(bholley)

Attachment #362537 - Flags: review+

Alfred Kayser

Comment 8

•

16 years ago

Note, lcms has been replaced with qcms recently.

Makoto Kato [:m_kato]

Assignee

Comment 9

•

16 years ago

qcms is bug 487900.

Alfred Kayser

Comment 10

•

16 years ago

Marking WONTFIX as lcms is no longer used, and the actual improvement is added to qcms.

Status: ASSIGNED → RESOLVED

Closed: 16 years ago

Resolution: --- → WONTFIX

Bobby Holley (:bholley)

Comment 11

•

16 years ago

Comment on attachment 362537 [details] [diff] [review] patch v1.1 not using lcms anymore. canceling sr

Attachment #362537 - Flags: superreview?(vladimir)

You need to log in before you can comment on or make changes to this bug.

Bugzilla

LCMS SSE2 optimization for Windows x64

Categories

(Core :: Graphics: Color Management, defect)

Tracking

()

People

(Reporter: m_kato, Assigned: m_kato)

References

Details

Crash Data

Security

(public)

User Story

Attachments

(1 file, 1 obsolete file)

Description

Comment 1

Comment 2

Comment 3

Comment 4

Comment 5

Comment 6

Comment 7

Comment 8

Comment 9

Comment 10

Comment 11

Attachment

General

Description

File Name

Content Type