384101 - text.getTextAtOffset broken for TEXT_BOUNDARY_LINE_START

Reporter

Description

•

17 years ago

In GNOME bugzilla bug http://bugzilla.gnome.org/show_bug.cgi?id=355525, I was tracking down why a certain feature of Orca wasn't working correctly. It turns out that the URL I was using as a test case (http://bugzilla.gnome.org/attachment.cgi?id=83911) contained a document whose text included embedded object and new line characters. To help debug this in Orca, I added the following code to examine the text of the document frame. This code merely just goes character by character through the text of the document frame, calling getTextAtOffset for each character position: if accessible.role == rolenames.ROLE_DOCUMENT_FRAME: for i in range(0, length): character = self.script.getText(accessible, i, i + 1) if character == self.script.EMBEDDED_OBJECT_CHARACTER: character = "EMBEDDED_OBJECT_CHARACTER" elif character == "\n": character = "\\n" print "%d. '%s'" % (i, character) [string, startOffset, endOffset] = text.getTextAtOffset( i, atspi.Accessibility.TEXT_BOUNDARY_LINE_START) print " line(%d, %d) = '%s'" \ % (startOffset, endOffset, string) For each character in the text for the document frame, the output tells us what the index of the character is, what the character itself is, and what Gecko thinks the line is for that character, including the start and end offset for the line. Things seem to start failing around character 19, which is the 'T' that begins the line "This sentence is bold." Instead of failing as it did, I would expect getTextAtOffset for a value of TEXT_BOUNDARY_START to return the entire line. Here's the sample output: 0. 'EMBEDDED_OBJECT_CHARACTER' line(0, 2) = ' ' 1. '\n' line(0, 2) = ' ' 2. '\n' line(2, 3) = ' ' 3. 'EMBEDDED_OBJECT_CHARACTER' line(3, 4) = '' 4. 'EMBEDDED_OBJECT_CHARACTER' line(5, 18) = 'Text Formats ' 5. 'T' line(5, 18) = 'Text Formats ' 6. 'e' line(5, 18) = 'Text Formats ' 7. 'x' line(5, 18) = 'Text Formats ' 8. 't' line(5, 18) = 'Text Formats ' 9. ' ' line(5, 18) = 'Text Formats ' 10. 'F' line(5, 18) = 'Text Formats ' 11. 'o' line(5, 18) = 'Text Formats ' 12. 'r' line(5, 18) = 'Text Formats ' 13. 'm' line(5, 18) = 'Text Formats ' 14. 'a' line(5, 18) = 'Text Formats '15. 't' line(5, 18) = 'Text Formats ' 16. 's' line(5, 18) = 'Text Formats ' 17. '\n' line(5, 18) = 'Text Formats ' 18. '\n' line(18, 19) = ' ' 19. 'T' line(18, 20) = ' T' 20. 'h' line(18, 19) = ' ' 21. 'i' line(18, 19) = ' ' 22. 's' line(18, 19) = ' ' 23. ' ' line(18, 19) = ' ' 24. 's' line(18, 19) = ' ' 25. 'e' line(18, 19) = ' ' 26. 'n' line(18, 19) = ' ' 27. 't' line(18, 19) = ' ' 28. 'e' line(18, 19) = ' ' 29. 'n' line(18, 19) = ' ' 30. 'c' line(18, 19) = ' ' 31. 'e' line(18, 19) = ' ' 32. ' ' line(18, 19) = ' ' 33. 'i' line(18, 19) = ' ' 34. 's' line(18, 19) = ' ' 35. ' ' line(18, 19) = ' ' 36. 'b' line(18, 19) = ' ' 37. 'o' line(18, 19) = ' ' 38. 'l' line(18, 19) = ' ' 39. 'd' line(18, 19) = ' ' 40. '.' line(18, 19) = ' ' 41. 'EMBEDDED_OBJECT_CHARACTER' line(41, 42) = ''

Willie Walker

Reporter

Updated

•

17 years ago

Assignee: nobody → aaronleventhal

Component: Disability Access → Disability Access APIs

Product: Firefox → Core

QA Contact: disability.access → accessibility-apis

Scott Haeger

Comment 1

•

17 years ago

An oddity can be seen using Accerciser on the test page (2nd link in opening comment). The accessible at 0 4 8 0 0 2 is a ghost accessible (not a link) with no role or text. In addition, the second and third lines of text are not shown in the accessible tree.

Scott Haeger

Comment 2

•

17 years ago

After examining the markup, I suspect the nasty bug is to blame.