1291249 - Logs uploaded by generic-worker to public-artifacts.taskcluster.net aren't using gzip

Reporter

Description

•

8 years ago

Compare:

1) curl -IL "https://queue.taskcluster.net/v1/task/Kf6jXsCGRGqu9N-6QLmhdA/runs/0/artifacts/public%2Flogs%2Flive_backing.log"
2) curl -IL "https://queue.taskcluster.net/v1/task/BNw5W2zzQdifz6iQS0Hc8g/runs/0/artifacts/public%2Flogs%2Flive_backing.log"

For both, the initial redirect (presumably served by cloud-mirror) is identical (and points to public-artifacts.taskcluster.net).

However the response from public-artifacts.taskcluster.net differs between the two:

For (1):
 - Content-Type: text/plain; charset=utf-8
 - Content-Encoding: gzip
 - Server: AmazonS3
 - X-Cache: Miss from cloudfront
 - Via: 1.1 0e74db3db7a48d7f6e7f64382ca5c629.cloudfront.net (CloudFront)
 - x-amz-version-id: TE5s3plkjKgBZb2FRdukrukeN940HqDg
 - X-Amz-Cf-Id: nAnd-_UToIJvatX0HRIPa6-kUru-aCE8RXwjwsBG5S9nQR7LlUboNw==

For (2):
 - Content-Type: text/plain
 - (No `Content-Encoding` header)
 - Server: AmazonS3
 - X-Cache: Miss from cloudfront
 - Via: 1.1 4222b2a73c8078ae05f5cfa25b5cd0ab.cloudfront.net (CloudFront)
 - x-amz-version-id: clGXtTUi3qcrhVzN8H2RzKxyCyz0wI3P
 - X-Amz-Cf-Id: E0QD4YOLEaD66JBz9KHPOzpUceR4xALaSlxZAN8VmxP6rk_0lmzTRg==

ie for the 2nd, the response isn't gzipped.

Are these coming from separate S3 buckets, where one has gzip enabled, and others not?

The problem is that this causes transfers to take more time, plus currently breaks the Treeherder log parser, since it assumes the response will be gzipped (something we could fix, though it would mean we wouldn't spot gzip regressions, so I'm on the fence as to whether we should do so).

Many thanks :-)

Flags: needinfo?(jhford)

Flags: needinfo?(garndt)

Greg Arndt [:garndt]

Comment 1

•

8 years ago

It appears the differences here are that #1 is a docker-worker task that gzips the logs and #2 is a generic-worker task where it does not do the same.

Pete, is there any reason we do not gzip the logs that generic worker uploads?

Flags: needinfo?(garndt) → needinfo?(pmoore)

Ed Morley [:emorley]

Reporter

Comment 2

•

8 years ago

Ah thank you (I didn't even know there were two different types of workers).

Component: Integration → Generic-Worker

Summary: Some logs served from public-artifacts.taskcluster.net aren't using gzip → Logs uploaded by generic-worker public-artifacts.taskcluster.net aren't using gzip

Ed Morley [:emorley]

Reporter

Updated

•

8 years ago

Summary: Logs uploaded by generic-worker public-artifacts.taskcluster.net aren't using gzip → Logs uploaded by generic-worker to public-artifacts.taskcluster.net aren't using gzip

Ed Morley [:emorley]

Reporter

Updated

•

8 years ago

Github Pull Request for generic-worker 8 years ago Pete Moore [:pmoore][:pete] 53 bytes, text/x-github-pull-request	garndt : review+	Details \| Review
Github Pull Request for OpenCloudConfig 8 years ago Pete Moore [:pmoore][:pete] 56 bytes, text/x-github-pull-request	grenade : review+	Details \| Review