Open Bug 1102450 (fix-readability) Opened 7 years ago Updated 3 years ago

[meta] Readability algorithm improvements

Categories

(Toolkit :: Reader Mode, defect)

All
Android
defect
Not set
normal

Tracking

()

People

(Reporter: Margaret, Unassigned)

References

(Depends on 13 open bugs)

Details

(Keywords: meta)

With bug 786638, we'll be able to easily add new testcases to the tree to test different Readability features. I'm taking this opportunity to collect some bugs that we could fix to make our client-side article parsing better.
Depends on: 760554
Depends on: 997134
Depends on: 997504
Duplicate of this bug: 1028391
Depends on: 800305
Depends on: 847844
Depends on: 1107097
Depends on: 785549
Component: Readability → Reader Mode
Product: Firefox for Android → Toolkit
Version: Firefox 35 → Trunk
Depends on: 1124275
Depends on: 784653
Depends on: 1125711
Depends on: 1127778
Depends on: 1127795
Depends on: 1128916
No longer depends on: 1128916
FYI, development for these issues should happen in this shared library on github:
https://github.com/mozilla/readability
Depends on: 1131393
Depends on: 1131464
Depends on: 1134810
Depends on: 1134818
Depends on: 1134965
Depends on: 1046112
No longer depends on: 1107097
Depends on: 1137258
Depends on: 794958
Depends on: 780664
Depends on: 787260
Depends on: 794480
Depends on: 795906
Depends on: 800165
Depends on: 809724
Depends on: 792366
Depends on: 1128916
Depends on: 1139165
Alias: fix-readability
Keywords: meta
Summary: [meta] Readability algorithm improvements → Readability algorithm improvements
Depends on: 1142312
Depends on: 1112911
Depends on: 1144407
Depends on: 1144441
Depends on: 1144355
No longer depends on: 1128916
Depends on: 1161123
Depends on: 1166687
Depends on: 1167568
Depends on: 1167569
Depends on: 1167573
Depends on: 1168101
Depends on: 1171894
Depends on: 1176851
Depends on: 1177360
Depends on: 1179222
Depends on: 1182260
if there is a table in the article it would be nice to replace the <tr> with a <br> and the <td> and <th> with a dash - (It currently makes one long string from my tables)

Have

foo - bar - foobar
13 - 12 - 13
4 - 55 - 66

rahter than

foobarfoobar13221345566
Depends on: 1230050
Depends on: 1240035
Depends on: 1261540
Depends on: 1324630
In the test cases, for Wikipedia page, in metadata, I think the title should be Mozilla" instead of "Mozilla - Wikipedia" so it would use the actual article title and not the string in the title tag, so if my assumption is right can someone confirm and I can try improve the _getArticleTitle function
Depends on: 1532277
Summary: Readability algorithm improvements → [meta] Readability algorithm improvements
You need to log in before you can comment on or make changes to this bug.