Closed Bug 910385 Opened 11 years ago Closed 8 years ago

ftp scraper not capturing b2g builds anymore

Categories

(Socorro :: Webapp, task)

x86
All
task
Not set
normal

Tracking

(Not tracked)

VERIFIED WONTFIX

People

(Reporter: lonnen, Unassigned)

Details

Attachments

(1 file)

For the last few weeks (how long?) the ftp scraper has been failing to capture b2g builds. Something changed on releng's side and we need to update to ftp scraper to get builds again.
Assignee: nobody → rhelmer
OS: Mac OS X → All
AFAIK, the internal for-device builds are not on public FTP and the end-user builds are not built by us and therefor in no place accessible to Mozilla, usually, both of which makes things harder for us here. We should be able to scrape for the desktop builds though, so something is probably going wrong there.
(In reply to Robert Kaiser (:kairo@mozilla.com) from comment #1) > AFAIK, the internal for-device builds are not on public FTP and the end-user > builds are not built by us and therefor in no place accessible to Mozilla, > usually, both of which makes things harder for us here. > > We should be able to scrape for the desktop builds though, so something is > probably going wrong there. Yep, FTP directory changed slightly, here's a PR: https://github.com/mozilla/socorro/pull/1451
Target Milestone: --- → 57
No longer blocks: 910372
Commits pushed to master at https://github.com/mozilla/socorro https://github.com/mozilla/socorro/commit/12a59a3764e4dd70d9d35d71e3e5cfa21eabdf03 fixes bug 910385 - manifests have moved to nightly dir https://github.com/mozilla/socorro/commit/1e9aaf47fd8f35a617e08dbb4edec6db5582050d Merge pull request #1451 from rhelmer/bug910385-b2g-ftpscraper-broken fixes bug 910385 - manifests have moved to nightly dir
Status: NEW → RESOLVED
Closed: 11 years ago
Resolution: --- → FIXED
Attached image qa - reopened
Reopening for a bit of investigation - /builds/ on both stage and prod appear to be empty. Are we scraping/inserting into the db/presenting the data in the webapp correct? https://crash-stats.mozilla.com/builds/products/B2G https://crash-stats.allizom.org/builds/products/B2G
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Did some preliminary investigation - looks like we are capturing the metadata into releases_raw table, so the basic premise of this bug is fixed. I think it's probably falling down in the normalization process somewhere, here's what I've found so far: There are several "repository" entries which are not present in the "release_repositories" table - we surely do not want experimental branches, but we should investigate the ones like "mozilla-b2g18_v1_0_1" and "mozilla-b2g18_v1_0_0": breakpad=> select distinct repository from releases_raw where product_name = 'b2g'; repository ---------------------- oak mozilla-b2g18 mozilla-central mozilla-b2g18_v1_0_1 ionmonkey fx-team mozilla-aurora date mozilla-beta mozilla-b2g18_v1_0_0 b2g-release (11 rows) Also I notice that the "build_type" column in releases_raw is "nightly" rather than "Nightly" for the latest b2g builds, not sure if this matters.. I'd suggest we match case-insensitively here (if we aren't already (from a quick grep through the stored procedures, I think this isn't the problem, just something I wanted to mention Just In Case) Selena, would you mind looking at this? I suspect that we just need to add a few repositories to release_repositories and run backfill.
Flags: needinfo?(sdeckelmann)
Assignee: rhelmer → sdeckelmann
Flags: needinfo?(sdeckelmann)
Target Milestone: 57 → ---
Here are the repos that are reported in releases_raw but not in release_respositories: breakpad=# select distinct repository from releases_raw where product_name = 'b2g' AND repository NOT IN ( select repository from release_repositories); repository ---------------------- oak mozilla-b2g18 mozilla-b2g18_v1_0_1 ionmonkey fx-team date mozilla-b2g18_v1_0_0 b2g (8 rows) I'm attempting a backfill on stage to see what happens...
I think the relevant code is: https://github.com/mozilla/socorro/blob/master/socorro/external/postgresql/products_builds.py#L45 Maybe tomorrow someone can help me set up local middleware so I can get it to spit out the SQL. I think there's a problem with the date logic.
(In reply to Selena Deckelmann :selenamarie :selena from comment #7) > I think the relevant code is: > https://github.com/mozilla/socorro/blob/master/socorro/external/postgresql/ > products_builds.py#L45 > > Maybe tomorrow someone can help me set up local middleware so I can get it > to spit out the SQL. I think there's a problem with the date logic. Using local mware is documented on mana now - I know both peterbe and I have it going ok (I use it quite a bit) so I am sure one of us can help. The easiest way to get it to spit out SQL is to uncomment: https://github.com/mozilla/socorro/blob/master/socorro/external/postgresql/base.py#L86 I noticed there's a typo on that line though and have neglected to file a bug, it should be: self.context.logger.debug(connection.cursor().mogrify(sql, params)) ( n.b. cursor() not cursor )
Is this still an issue?
Assignee: sdeckelmann → nobody
no longer a concern
Status: REOPENED → RESOLVED
Closed: 11 years ago8 years ago
Resolution: --- → WONTFIX
Bumping to verified wontfix.
Status: RESOLVED → VERIFIED
You need to log in before you can comment on or make changes to this bug.

Attachment

General

Created:
Updated:
Size: