|
Because file identification cookies (for example, filenames, device and inode numbers, volume and file IDs, and so on) are not necessarily unique or maintained across system reboots, each Berkeley DB database file contains a 20-byte file identification bytestring that is stored in the first page of the database, starting with the 53rd byte on the page. When multiple processes or threads open the same database file in Berkeley DB, it is this bytestring that is used to ensure that the same underlying pages are updated in the shared memory buffer pool, no matter which Berkeley DB handle is used for the operation.
It is usually a bad idea to physically copy a database to a new name. In the few cases in which copying is the best solution for your application, you must guarantee that there are never two different databases with the same file identification bytestring in the memory pool at the same time. Copying databases is further complicated by the fact that the shared memory buffer pool does not discard all cached copies of pages for a database when the database is logically closed; that is, when DB->close is called. Nor is there a Berkeley DB interface to explicitly discard pages from the shared memory buffer pool for any particular database.
Before copying a database, you must ensure that all modified pages have been written from the memory pool cache to the backing database file. This is done using the DB->sync or DB->close interfaces.
Before using a copy of a database from Berkeley DB, you must ensure that all pages from any database with the same bytestring have been removed from the memory pool cache. If the environment in which you intend to open the copy of the database potentially has pages from files with identical bytestrings to the copied database (which is likely to be the case), there are a few possible solutions: