Bug 1529012 Comment 7 Edit History

Note: The actual edited comment in the bug view page will always show the original commenter’s name and original timestamp.

I [briefly investigated the correctness of `first_use_date`](https://colab.research.google.com/drive/1GEj9vedTOqz85OY2UCntfO8fWR4VJDLq?usp=sharing). One of my main findings is that a users' `min(first_use_date)` is equal to their `min(creation_date)` [1] for ~98.5% of users who submit those values over a recent two week period. Since I was not able to fully understand what the bugs were in `creation_date` (bug 1449739), I'm not sure what conclusion to draw from this: on recent data, it seems likely that they either both have the bugs or neither of them do. `first_use_date` otherwise looked fairly sensible.

I wonder if the bugged `creation_date` values came primarily from the method to backfill a creation date based on the files in the profile and that we have proportionally fewer of these clients on recent data such that `first_use_date` and `creation_date` look very similar.

Disclaimer: I am not a data scientist, this analysis was not reviewed, and it was brief.
_(cross-post [to JIRA](https://mozilla-hub.atlassian.net/browse/DS-107?focusedCommentId=568958))_

I [briefly investigated the correctness of `first_use_date`](https://colab.research.google.com/drive/1GEj9vedTOqz85OY2UCntfO8fWR4VJDLq?usp=sharing). One of my main findings is that a users' `min(first_use_date)` is equal to their `min(creation_date)` [1] for ~98.5% of users who submit those values over a recent two week period. Since I was not able to fully understand what the bugs were in `creation_date` (bug 1449739), I'm not sure what conclusion to draw from this: on recent data, it seems likely that they either both have the bugs or neither of them do. `first_use_date` otherwise looked fairly sensible.

I wonder if the bugged `creation_date` values came primarily from the method to backfill a creation date based on the files in the profile and that we have proportionally fewer of these clients on recent data such that `first_use_date` and `creation_date` look very similar.

Disclaimer: I am not a data scientist, this analysis was not reviewed, and it was brief.

Back to Bug 1529012 Comment 7