Closed
Bug 1152462
Opened 10 years ago
Closed 10 years ago
Server-side versioning of pipeline data
Categories
(Cloud Services Graveyard :: Metrics: Pipeline, defect, P2)
Cloud Services Graveyard
Metrics: Pipeline
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: mreid, Assigned: mreid)
Details
We need a mechanism for handling server-side data changes, including format/layout changes, locations, backprocessing, and migrating data.
I propose that we allocate a "meta" prefix within our production S3 bucket, and store mappings of the current location of each logical data source.
In other words, we have a "telemetry" data source, which could map to a bucket prefix of "/telemetry/v1", then if we need to modify the storage layout or data format, we could update to "/telemetry/v2".
Consumers of the data could consult the metadata to determine where the correct data is located (ideally this would be built in to client applications).
We could attach other metadata to data sources using this approach as well, such as the location of data specification documentation, the current storage layout, and so on.
Updated•10 years ago
|
Priority: -- → P2
Assignee | ||
Updated•10 years ago
|
Assignee: nobody → whd
Comment 1•10 years ago
|
||
https://github.com/mozilla-services/svcops/commit/3c3856b4ebbd8cd3cf7fa809775bf6baadb720f6
https://github.com/mozilla-services/svcops/commit/ff4fd5cc214c62888622e7268101c08c5651d65b
The production metadata bucket has been set up: s3://net-mozaws-prod-us-west-2-pipeline-metadata. It has RO access from new dev, and RW access via the production S3 credentials for the pipeline team. :mreid will populate the bucket with the requisite metadata.
We decided to use a separate bucket instead of an object prefix within the current bucket for flexibility.
Assignee | ||
Updated•10 years ago
|
Assignee: whd → mreid
Assignee | ||
Comment 2•10 years ago
|
||
The meta bucket is there, I will write up some docs asap.
Assignee | ||
Comment 3•10 years ago
|
||
I've added some documentation here:
https://wiki.mozilla.org/CloudServices/DataPipeline/Metadata
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Updated•6 years ago
|
Product: Cloud Services → Cloud Services Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•