Closed Bug 1271719 Opened 8 years ago Closed 8 years ago

Firefox Engagement Ratio Spark job is failing

Tracking

(Not tracked)

Status:

RESOLVED FIXED

People

(Reporter: mreid, Assigned: mreid)

Details

Mark Reid [:mreid]

Assignee

Description

•

8 years ago

Figure out why, fix it.

Mark Reid [:mreid]

Assignee

Comment 1

•

8 years ago

Looks somewhat obscure from the log:

...snip...
Caused by: java.io.FileNotFoundException: /mnt/yarn/usercache/hadoop/appcache/application_1462853298787_0001/blockmgr-f4358145-c989-417a-bcce-d0bfcc1a1371/09/shuffle_23_12_0.index (No such file or directory)
	at java.io.FileInputStream.open(Native Method)
	at java.io.FileInputStream.<init>(FileInputStream.java:146)
	at org.apache.spark.network.shuffle.ExternalShuffleBlockResolver.getSortBasedShuffleBlockData(ExternalShuffleBlockResolver.java:275)
	... 27 more

	at org.apache.spark.storage.ShuffleBlockFetcherIterator.throwFetchFailedException(ShuffleBlockFetcherIterator.scala:323)
	at org.apache.spark.storage.ShuffleBlockFetcherIterator.next(ShuffleBlockFetcherIterator.scala:300)
	at org.apache.spark.storage.ShuffleBlockFetcherIterator.next(ShuffleBlockFetcherIterator.scala:51)
	at scala.collection.Iterator$$anon$11.next(Iterator.scala:328)

Assignee: nobody → mreid

Points: --- → 1

Priority: -- → P1

Mark Reid [:mreid]

Assignee

Comment 2

•

8 years ago

There was a problem with the job that converts the Heka Executive Summary stream to Parquet, so the underlying derived dataset was missing some data.

I've changed the job to use the main_summary dataset here:
https://github.com/mozilla-services/data-pipeline/pull/205

The dashboard data has already been fixed and backfilled.

Mark Reid [:mreid]

Assignee

Updated

•

8 years ago

Status: NEW → RESOLVED

Closed: 8 years ago

Resolution: --- → FIXED

BMO Automation

Updated

•

6 years ago

Product: Cloud Services → Cloud Services Graveyard

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Firefox Engagement Ratio Spark job is failing

Categories

(Cloud Services Graveyard :: Metrics: Pipeline, defect, P1)

Tracking

(Not tracked)

People

(Reporter: mreid, Assigned: mreid)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Comment 2

Updated

Updated