Closed Bug 625501 Opened 14 years ago Closed 14 years ago

Socorro - crash mover inadequacy

Tracking

(Not tracked)

Status:

RESOLVED WONTFIX

People

(Reporter: lars, Unassigned)

Details

K Lars Lohn [:lars] [:klohn]

Reporter

Description

•

14 years ago

the crash mover works fine if the primary storage is the file system.  However, it doesn't work well as the system for resubmitting if HBase is the primary storage.  The problem is that the crash mover is long running and meant to go as fast as it can.  As soon as it sees a crash in the fallback storage, it tries to submit it to hbase.  That gives hbase no time to recover from the problem that caused the crash to go into fallback storage.

The hbaseResubmit cron has a built in delay and is much more appropriate for the job.  It is well tested and been used in production for months.  It is my intent to bring it forward from 175 into 176.

K Lars Lohn [:lars] [:klohn]

Reporter

Comment 1

•

14 years ago

this is bogus - I've forgotten that newCrashMover has a backoff retry on the HBase connection.  If hbase is in fail mode, the newCrashMover may, indeed, try immediately to insert, but if it fails, it will back off, holding on to that crash until it can insert it.

we'll revisit this if we encounter problems

Status: NEW → RESOLVED

Closed: 14 years ago

Resolution: --- → WONTFIX

Nobody; OK to take it and work on it

Assignee

Updated

•

13 years ago

Component: Socorro → General

Product: Webtools → Socorro

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

Socorro - crash mover inadequacy

Categories

(Socorro :: General, task)

Tracking

(Not tracked)

People

(Reporter: lars, Unassigned)

References

Details

Crash Data

Security

(public)

User Story

Description

Comment 1

Updated