Open Bug 1646502 Opened 5 years ago Updated 1 month ago

[meta] Simplify log parsing - switch to artifacts for Perfherder data and failure lines

Tracking

(Not tracked)

Status:

NEW

People

(Reporter: sclements, Unassigned)

References

(Blocks 1 open bug)

Details

(Whiteboard: [fxp])

Sarah Clements [:sclements]

Reporter

Description

•

5 years ago

•

Edited

Per a few recent discussions, log parsing could be greatly simplified and made more resilient if we weren't parsing live_backing logs for PERFHERDER_DATA log lines and failure lines for every task.

A few steps to take:

Switch to consuming Perfherder artifacts (json would be ideal)
Switch to only consuming error_summary.json files (standardized naming would be great, per bug 1629716)*. This will also help with data Push Health provides since it relies on data from these logs. Currently we're doing both (if an error_summary file exists) with the TextLogError/Step lines used for the log viewer and the bug suggestions API.

In order to implement #2, we'd need to investigate whether we can get non-browser crash data some other way than from the live_backing log.

Sarah Clements [:sclements]

Reporter

Comment 1

•

5 years ago

We also might be able to reduce table size further in Treeherder since failure lines from the live_backing log are stored in the TextLogError table and data from the error summary logs are stored in the FailureLines table.

Nobody; OK to take it and work on it

Assignee

Updated

•

4 years ago

Component: Treeherder: Log Parsing & Classification → TreeHerder

Greg Mierzwinski [:sparky]

Updated

•

1 year ago

Whiteboard: [fxp]

Jira Integration Bot

Updated

•

1 year ago

See Also: → https://mozilla-hub.atlassian.net/browse/FXP-3494

Greg Mierzwinski [:sparky]

Updated

•

1 year ago

Depends on: 1892260

Andrew Halberstadt [:ahal]

Comment 2

•

1 year ago

Big +1 to consuming a specialized artifact over parsing logs.

But while we're changing the ingestion, I would also like to request that the artifact support a custom path, and possibly even multiple paths.

We have a new type of Taskcluster task that farms out builds to Bitrise. In order to cut costs (because all these tasks do is poll the Bitrise API in a loop), we've added the ability for them to trigger multiple Bitrise workflows at once. But this means that we need to namespace the artifacts from those workflows, e.g instead of public/live.log, it is public/<bitrise workflow name>/live.log.

So when designing this new format, I'd like to request some way consumers can specify where the artifact(s) live (even if it's just hardcoded into a config in Treeherder, that's probably fine).

Greg Mierzwinski [:sparky]

Updated

•

1 year ago

Blocks: 1831943

Myeongjun Go

Updated

•

3 months ago

Blocks: 1990742

Greg Mierzwinski [:sparky]

Updated

•

3 months ago

No longer blocks: 1990742

Depends on: 1990742

Myeongjun Go

Updated

•

2 months ago

Blocks: 1995932

Myeongjun Go

Updated

•

2 months ago

Blocks: 1998197

Myeongjun Go

Updated

•

2 months ago

Blocks: 1999975

Myeongjun Go

Updated

•

2 months ago

No longer blocks: 1999975

Greg Mierzwinski [:sparky]

Updated

•

1 month ago

Depends on: 1986824

Greg Mierzwinski [:sparky]

Updated

•

1 month ago

No longer blocks: 1998197

Depends on: 1998197

Greg Mierzwinski [:sparky]

Updated

•

1 month ago

No longer blocks: 1995932

Depends on: 1995932

You need to log in before you can comment on or make changes to this bug.

Bugzilla

[meta] Simplify log parsing - switch to artifacts for Perfherder data and failure lines

Categories

(Tree Management :: Treeherder, enhancement)

Tracking

(Not tracked)

People

(Reporter: sclements, Unassigned)

References

(Blocks 1 open bug)

Details

(Whiteboard: [fxp])

Crash Data

Security

(public)

User Story

Description

Comment 1

Updated

Updated

Updated

Updated

Comment 2

Updated

Updated

Updated

Updated

Updated

Updated

Updated

Updated

Updated

Updated