Closed Bug 993758 Opened 12 years ago Closed 11 years ago

Access to public mailman list archives for Michael Trinkala

Categories

(Infrastructure & Operations :: Infrastructure: Mail, task)

x86
macOS
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: csorenson, Unassigned)

Details

From SN RITM0025373 - One of my team is interested in using our corpus of public email lists to generate a searchable database. Is there an API he could access to slurp down this data, or some other way? Thanks!
Note that a usenet or google groups slurper would be able to pull them down easily, since they're all mirrored to usenet/etc (for the most part).
Adding trink to the bug, since he's the one who would actually be accessing the data.
Summary: Access to public mailman list archives for Tobey Elliott → Access to public mailman list archives for Michael Trinkala
Anything actionable here? Only way this is possible is to use something like the archives in usenet per comment 1. Other than that there is no API on the mailman part of things
Reading directly from the file system would be preferred.
(In reply to Mike Trinkala [:trink] from comment #4) > Reading directly from the file system would be preferred. For security reasons we cannot provide you with access to the server to actually read these files if they are indeed archived. Best course of action here for you is to read from NNTP. The list of our lists that are synced via NNTP is listed here https://lists.mozilla.org/newsgroups.txt
If it's of any help, https://docs.python.org/3/library/nntplib.html has a brief, although useful, example of how to scrape a newsgroup archive of a mailing list using Python.
Seems like we did the best we could here. R/Fing for now, feel free to reopen if necessary.
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
You need to log in before you can comment on or make changes to this bug.