Slashdot Mirror


GitLab Says It Found Lost Data On a Staging Server (theregister.co.uk)

GitLab.com, the wannabe GitHub alternative that went down hard earlier this week and reported data loss, has said that some data is gone but that its services are now operational again. From a report The Register: The incident did not result in Git repos disappearing. Which may be why the company's PR reps characterised the lost data as "peripheral metadata that was written during a 6-hour window". But in a prose account of the incident, GitLab says "issues, merge requests, users, comments, snippets, etc" were lost. The Register imagines many developers may not be entirely happy with those data types being considered peripheral to their efforts. GitLab's PR flaks added that the incident impacted "less than 1% of our user base." But the firm's incident log says 707 users have lost data. The startup, which has raised over $25 million, added that it lost six hours of data and asserted that the lost doesn't include users' code.

10 of 101 comments (clear)

  1. PFY ... by PPH · · Score: 3, Funny

    ... couldn't remember the exact database maintenance command sequence. So he called BOFH at home after hours for assistance.

    --
    Have gnu, will travel.
  2. Re:Live by the cloud, by houstonbofh · · Score: 4, Insightful

    The hard part is having a backup plan for your "cloud." Some places make it easy, but some make it VERY hard. Never used gitlab so I can not comment... But if YOU do not have a backup, there are no backups. As Codespaces users found out, and now Gitlab, kinda...

  3. "wannabe GitHub alternative" ? by TheDarkener · · Score: 3, Insightful

    "GitLab.com, the wannabe GitHub alternative" ... Uhm, is that really accurate?

    --
    It is pitch black. You are likely to be eaten by a grue.
    1. Re:"wannabe GitHub alternative" ? by Anonymous Coward · · Score: 5, Informative

      A "github clone" which comes with a CE edition which is FLOSS, and an EE edition, for either zero-cost (CE edition), or just $ (EE edition). And in both cases, you can have your own on-premises. github would be $$$, and I don't think it does on-premises (but even if it does, it is a lot more expensive).

      It is also vastly preferred over github by anyone with small teams. It didn't get into fortune-500 by chance, nor did it get US$ 25M in funding by chance.

      But yes, if you hate github's usability or flows, there is no reason to believe you wouldn't hate gitlab as well. They are *not* the same, but they're close enough.

    2. Re:"wannabe GitHub alternative" ? by rl117 · · Score: 4, Interesting

      GitLab does a bunch of stuff which GitHub doesn't. The most significant for me is the integrated CI, and that you can host your own runners and workspaces on your own infrastructure (or some cloud provider). Compared with Travis or some other CI hook on GitHub, this is vastly more flexible and powerful. I also find the ability to assign people for review, milestones and such on issues and merge requests to be very nice features which GitHub lacks. It is a GitHub clone, but they seem to have taken the lead in implementing more advanced functionality. At work, we're currently looking into a trial of GitLab plus our own multi-platform CI runners as an alternative to GitHub+Travis and internal Jenkins with several hundred jobs. It stands to greatly simplify the amount of failures, admin time and developer time keeping that lot going.

  4. Re:Live by the cloud, by Anonymous Coward · · Score: 5, Informative

    GitLab is actually quite good at it, really.

    1. You can get all the wiki and code repo data by git cloning into a backup repository.
    2. You can set up a remote mirror that gets automatically updated for the code. I don't think you can do that for the wiki, though.
    3. Project admins can download a metadata dump to import in some other gitlab instance (e.g. a local instance of gitlab CE (floss) or EE (paid):
            The following items will be exported:
            Project and wiki repositories
            Project uploads
            Project configuration including web hooks and services
            Issues with comments, merge requests with diffs and comments, labels, milestones, snippets, and other project entities
    4. The data which is not exported (LFS objects, build traces and artifacts, container registry images) can be downloaded in some other way. E.g. LFS is usually cloned along with the git code repos.

    Note that (3) **includes** the webhooks data that was not fully recovered.

    So, yeah, anyone who lost truly important data in this gitlab.com event was actually just as guilty of not following the "Tao of Backup" properly as gitlab.com's sysadmins.

  5. Bad incident; great response by Wuhao · · Score: 5, Interesting

    Obviously, data loss is embarrassing. I think we all appreciate the importance of not only having multiple backups, but testing to ensure that your backups work, and are sufficient to fully restore operations. GitLab is just the latest in a long tradition of sites and services that have found themselves facing the consequences of not regularly testing their recovery plans.

    But I do respect their response. They quickly recognized what had happened, and they diagnosed what went wrong with their backups. They did not try to use PR-speak to conceal their mistake -- they publicly copped to it, in plain industry-standard language that their users would understand, and even offered a livestream of their team resolving the issue. I think this has been a masterclass in how to recover from a blunder. I bet you that this is not a mistake GitLab will be repeating anytime soon.

    Also, I think it's very fortunate that they're in the git repo business, and presumably users who had data that was affected by the loss still have a copy in their own local repos. Thank god for distributed SCM.

  6. Re: Live by the cloud, by twistedcubic · · Score: 3


    Why the hell would you "self-host" a cloud service?

    Almost any server can be "cloud service". There are several interesting solutions to the problem "I need to access a Git repository over the net" in "the cloud" or otherwise. For example, I self host because my code is so amazing, I can't risk having anyone see it lest they die from heart attack due to the overwhelming splendor.

  7. Re: Live by the cloud, by gwolf · · Score: 3, Informative

    Being it a Git repository, you don't have to worry too much about your "centralized" hosting provider – Each developer that has cloned a (non-shallow) repository will locally have everything needed to rebuild history were both providers to disappear. Git is a great backup strategy by itself :-)

  8. Why the axe to grind? by wbr1 · · Score: 5, Insightful
    "wannabe"
    "pr flacks"
    number doubting '"less than 1% of our user base." But the firm's incident log says 707 users have lost data"

    Why the negative tone? I am not a coder. I do not use GitLab or GitHub except for an occasional download. However, generally competition is good. Sure this company lost data.. so do many. The real questions are is this indicative of a systemic issue or just a one time occurrence. I just don't see why this level of negativity is being pushed against this company.

    --
    Silence is a state of mime.