If you think a bug might affect users in the 57 release, please set the correct tracking and status flags for Release Management.

Histograms class needs to catch large ints

NEW
Unassigned

Status

Data Platform and Tools
Telemetry APIs for Analysis
2 months ago
2 months ago

People

(Reporter: frank, Unassigned)

Tracking

Details

(Reporter)

Description

2 months ago
The android-clients job failed with the following error:

```
    Caused by: org.apache.spark.SparkException: Job aborted due to stage failure: Task 93 in stage 10.0 failed 4 times, most recent failure: Lost task 93.3 in stage 10.0 (TID 3513, ip-172-31-7-115.us-west-2.compute.internal): org.apache.spark.api.python.PythonException: Traceback (most recent call last):
      File "/mnt1/yarn/usercache/hadoop/appcache/application_1501690791625_0001/container_1501690791625_0001_01_000002/pyspark.zip/pyspark/worker.py", line 172, in main
        process()
      File "/mnt1/yarn/usercache/hadoop/appcache/application_1501690791625_0001/container_1501690791625_0001_01_000002/pyspark.zip/pyspark/worker.py", line 167, in process
        serializer.dump_stream(func(split_index, iterator), outfile)
      File "/usr/lib/spark/python/pyspark/rdd.py", line 2371, in pipeline_func
      File "/usr/lib/spark/python/pyspark/rdd.py", line 2371, in pipeline_func
      File "/usr/lib/spark/python/pyspark/rdd.py", line 317, in func
      File "/usr/lib/spark/python/pyspark/rdd.py", line 1792, in combineLocally
      File "/mnt1/yarn/usercache/hadoop/appcache/application_1501690791625_0001/container_1501690791625_0001_01_000002/pyspark.zip/pyspark/shuffle.py", line 236, in mergeValues
        for k, v in iterator:
      File "/mnt/anaconda2/lib/python2.7/site-packages/moztelemetry/spark.py", line 150, in <lambda>
        additional_histograms)) \
      File "/mnt/anaconda2/lib/python2.7/site-packages/moztelemetry/spark.py", line 232, in _get_ping_properties
        additional_histograms)
      File "/mnt/anaconda2/lib/python2.7/site-packages/moztelemetry/spark.py", line 283, in _get_merged_histograms
        additional_histograms)
      File "/mnt/anaconda2/lib/python2.7/site-packages/moztelemetry/spark.py", line 264, in _get_ping_property
        additional_histograms=additional_histograms)
      File "/mnt/anaconda2/lib/python2.7/site-packages/moztelemetry/histogram.py", line 163, in __init__
        entries = {int(k): v for k, v in instance["values"].items()}
      File "/mnt/anaconda2/lib/python2.7/site-packages/moztelemetry/histogram.py", line 163, in <dictcomp>
        entries = {int(k): v for k, v in instance["values"].items()}
    ValueError: invalid literal for int() with base 10: '3.0683611344442837e+257'
```

Which should be caught and discarded or converted in Python.
You need to log in before you can comment on or make changes to this bug.