Skip to content

How to use backups to move the external database to Kubernetes

The Operator allows restoring a backup not only on the Kubernetes cluster where it was made, but also on any Kubernetes-based environment with the installed Operator, and the backup/restore tool actually used by the Operator is the Percona Backup for MongoDB . That makes it possible to move external MongoDB Cluster to Kubernetes with Percona Backup for MongoDB.

Note

There are other scenarios for migrating MongoDB database to Kubernetes as well. For example, this blogpost covers migration based on the regular MongoDB replication capabilities.

Backups can be stored either locally, or remotely (on Amazon S3 or S3-compatible storage , or on Azure Blob Storage ). S3-compatible storage to be used for backups.

  1. Make sure the following prerequisite requirements are satisfied within your setup:

    • Percona Backup for MongoDB packages are installed on the replica set nodes of the source cluster following the official installation instructions , and the authentication of the pbm-agent is configured to allow it accessing your database.

    • The Operator and the destination cluster should be installed in the Kuberentes-based environment. For simplicity, it’s reasonable to have the same topology of the source and destination clusters, although Percona Backup for MongoDB allows replset-remapping as well.

  2. Configure the cloud storage for backups on your source cluster following the official guide . For example, using the Amazon S3 storage can be configured with the following YAML file:

    pbm_config.yaml
    type: s3
    s3:
      region: us-west-2
      bucket: pbm-test-bucket
      credentials:
        access-key-id: <your-access-key-id-here>
        secret-access-key: <your-secret-key-here>
    

    After putting all needed details into the file (AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, the S3 bucket and region in the above example), provide the config file to the pbm-agent on all nodes as follows:

    $ pbm config --file pbm_config.yaml
    
  3. Start the pbm-agent:

    $ sudo systemctl start pbm-agent
    
  4. Now you can make backup as follows:

    $ pbm backup --wait
    

    The command output will contain the backup name, which you will further use to restore the backup:

    Starting backup '2022-06-15T08:18:44Z'....
    Waiting for '2022-06-15T08:18:44Z' backup.......... done
    
    pbm-conf> pbm status -s backups
    
    Backups:
    ========
    FS  /data/pbm
      Snapshots:
        2022-06-15T08:18:44Z 28.23KB <logical> [complete: 2022-06-15T08:18:49Z]
    
  5. The rest of operations will be carried out on your destination cluster in a Kubernetes-based environment of your choice. These actions are described in the How to restore backup to a new Kubernetes-based environment guide. Just use the proper name of the backup (2022-06-15T08:18:44Z) in the above example, and proper parameters specific to your cloud storage (e.g. the pbm-test-bucket bucket name we used above).


Last update: 2024-09-15