Closed Bug 558245 Opened 14 years ago Closed 14 years ago

Modify HBase Thrift connection to use new row key format

Categories

(Socorro :: General, task)

task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: dre, Assigned: lars)

Details

As per the e-mail I sent to socorro-dev, I feel it would be best for us to change the HBase schema to store the OOID without prefixing the date.

It should be fairly straightforward to change in the code, but I'm happy to help with a review or patch if need be.
well, in the latest version of hbaseClient.py I just happened to refactor that into a single function call rather than having it spread like buckshot through the code.  The change will be trival.

Of course, we'll have to take all the exsting data within hbase and "correct" it, ja?  That will have to be a coordinated action. 

How soon do you want to make this change?
hehe.. buckshot.

My thoughts are to create a new table (but what better name than "crash_reports" could we possibly come up with? and start inserting the new format into that.  Then, we can easily have a MapReduce job that will take all the data from the old table and fix the row keys as it inserts into the new table.  Bonus, we can do the json fix at the same time.
The change described in the summary was done.
My thoughts in comment #2 are now expressed in other more specific bugs.
Status: NEW → RESOLVED
Closed: 14 years ago
Resolution: --- → FIXED
Component: Socorro → General
Product: Webtools → Socorro
You need to log in before you can comment on or make changes to this bug.