cyrusdb API
Intro
The cyrusdb
API is a common interface to a key-value store, used
throughout the Cyrus code. It allows a choice of different backends for
different access patterns, while ensuring a consistent interface.
This document will describe the interface, and how to use the cyrusdb interface from within parts of Cyrus code, as well as how to implement your own backend
If you pass incorrect values to these APIs, you will get an assertion failure in most cases. That's generally considered safer than silently breaking things. Exceptions are noted below.
Code Layout
The implementation of each interface is in lib/cyrusdb_NAME.c
, for
example lib/cyrusdb_flat.c. General functions are in lib/cyrusdb.c
and the interface in lib/cyrusdb.h
.
Configuration
The name of the backend for each of the main internal databases can be
configured in imapd.conf, for example: annotation_db: skiplist
. This
is then read in imap/global.h and imap/global.c during startup, so that
the global variable config_annotation_db
gets set to the configured
backend name:
config_annotation_db = config_getstring(IMAPOPT_ANNOTATION_DB);
(Beware the misleading naming here: config_annotation_db
is a string
describing the backend used by the database, not, say, its location on
disk.)
Internally, the main module for each database sets up struct of pointers
to the cyrusdb functions it implements, which is registered in
lib/cyrusdb.c
lib/cyrusdb.c
provides backend-agnostic wrapper functions for
interacting with cyrusdb databases.
A full example
struct db *db = NULL;
struct txn *tid = NULL;
const char *filename = NULL;
int flags = CYRUSDB_CREATE;
cyrus_init(alt_config, "toolname", 0);
filename = config_getstring(IMAPOPT_ANNOTATION_DB_PATH);
r = cyrusdb_open(config_annotation_db, filename, flags, &db);
r = cyrusdb_fetch(db, key, keylen, &data, &datalen, &tid);
r = cyrusdb_commit(db, tid);
r = cyrusdb_close(db);
cyrus_done();
Note that you always open a database first, and close it at the end. You
must always call cyrus_init() and cyrus_done() to properly initialize
and clean up the cyrusdb
environments.
This example also uses a transaction, meaning that the database is locked in exclusive mode between the 'fetch' (the first use of the transaction) and the commit.
About Transactions
The cyrusdb interface works in two modes - transactional and non-transactional. The value of the 'tid' parameter decides which mode is used. There are three possible values:
NULL - non-transactional. Will create a temporary lock for the duration of the current action - either a write lock for "store" or a read lock for "fetch". If you call "foreach", the lock will be dropped between each record fetched
Pointer to NULL - transactional, transaction not yet started. Will always take a write lock on the database, and update the pointer to point to the new transaction.
Pointer to a valid transaction. Will keep using this transaction
If you are currently in a transaction, you MUST pass the same transaction to every database call. It is not possible to mix or nest transactions. There is one exception in the skiplist backend: If you pass NULL to a fetch or foreach while the database is in a transaction, it will silently do the read in the current transaction rather than returning an error
API Reference
All functions follow the normal C API of returning '0' on success, and an error code on failure
cyrusdb_init(void)
Is called once per process. Don't call this yourself, use
cyrus_init()
. No other calls will be made until this is called.
cyrusdb_done(void)
The opposite of cyrusdb_init()
- called once per process to do any
cleaning up after all database usage is finished. Don't call this
yourself, use cyrus_done()
.
cyrusdb_sync(const char *backend)
Perform a checkpoint of the database environment. Used by berkeley
backend. Is called by ctl_cyrusdb -c
on a regular basis
cyrusdb_open(const char *backend, const char *fname, int flags, struct db **retdb)
Opens the database with the specified 'file name' (or other descriptor, for example the sql backend is not a filename), and if successful returns an opaque database structure
Flags:
CYRUSDB_CREATE - create the database if it doesn't exist
CYRUSDB_MBOXSORT - sort '.' first, so folder listing is correct
Errors:
CYRUSDB_IOERROR - if there is any error reading the file, or any corruption detected while loading the file
cyrusdb_close(struct db *db)
Close the named database. Will release any locks if they are still held, but it's bad practice to close without committing or aborting, so the backend should log an error
Errors:
CYRUSDB_IOERROR - if there are any errors during close
cyrusdb_fetch(struct db *db, const char *key, size_t keylen, const char **data, size_t *datalen, struct txn **tidptr)
cyrusdb_fetchlock(struct db *db, const char *key, size_t keylen, const char **data, size_t *datalen, struct txn **tidptr)
Fetch the value for the exact key given by key and keylen. If data is not NULL, set datalen and return a valid pointer to the start of the value.
Fetchlock is identical to fetch, but gives a hint to the database that the record is likely to be modified soon.
NOTE: it is possible to store a key with a zero length data record, in which case *datalen will be set to zero, and *data will be set to a non-NULL value
It is an error to call fetch with a NULL key or a zero keylen
It is an error to call fetch with a NULL datalen and a non-NULL data, however it is acceptable to call with a NULL data and a non-NULL datalen if you are only interested in the length
Errors:
CYRUSDB_IOERROR - if any error occurs reading from the database
CYRUSDB_LOCKED - if tidptr is incorrect
CYRUSDB_NOTFOUND - if there is no record that matches the key
cyrusdb_foreach(struct db *db, const char *prefix, size_t prefixlen, foreach_p *goodp, foreach_p *procp, void *rock, struct txn **tidptr)
cyrusdb_forone(struct db *db, const char *key, size_t keylen, foreach_p *goodp, foreach_p *procp, void *rock, struct txn **tidptr)
cyrusdb_foreach()
iterates over all records matching the given
prefix, in database order (which may be MBOXLIST sort, depending on the
parameters given to open
It is legal to give a NULL pointer as prefix if prefixlen is zero, in which case it will return all records in the database. It is an error to give a non-zero prefixlen with a NULL prefix.
cyrusdb_forone()
"iterates" over the single record matched by the
given key. If you've already built callbacks for processing each record
from a foreach, this lets you use the same interface to process a single
record.
goodp
- this function is only used for deciding if the record needs
to be further processed. It can be used for basic filtering, and returns
true (non-zero) to process, or zero to skip and move straight to the
next record. Because goodp can't make any database changes, it doesn't
break the lock, so it's faster to use goodp to filter records if you
don't need to process all of them. NULL is a legal value for goodp, and
means that all records will be processed.
procp
- procp is the main callback function. If you use foreach in
non-transactional mode, the database is unlocked before calling procp,
and locked again afterwards. You are allowed to add, delete or modify
values in the same database from within procp. If procp returns
non-zero, the foreach loop breaks at this point, and the return value of
the foreach becomes the return value of procp. If procp returns zero,
the foreach loop will continue at the NEXT record by sort order,
regardless of whether the current record has changed or been removed.
procp MUST NOT be NULL.
Errors:
procp_result - whatever your callback returns
CYRUSDB_IOERROR - if any error occurs while reading
CYRUSDB_LOCKED - if tidptr is incorrect
cyrusdb_create(struct db *db, const char *key, size_t keylen, const char *data, size_t datalen, struct txn **tidptr)
cyrusdb_store(struct db *db, const char *key, size_t keylen, const char *data, size_t datalen, struct txn **tidptr)
Create a new record or replace an existing one. The only difference
between these two is that cyrusdb_create
will return an error if the
record already exists, while cyrusdb_store
will replace it
If tidptr is NULL, create/store will take a write lock for the duration of the action.
Any failure during create/store will abort the current transaction as well as returning an error
It is legal to pass NULL for the data field ONLY if datalen is zero. It is not legal to pass NULL for key or zero for keylen
Errors:
CYRUSDB_IOERROR - any error to write to the database
CYRUSDB_LOCKED - if tidptr is incorrect
CYRUSDB_EXISTS - if
cyrusdb_create
is called on an existing keyCYRUSDB_AGAIN - if a deadlock is created. The current transaction has been aborted, but a retry may succeed
cyrusdb_delete(struct db *db, const char *key, size_t keylen, struct txn **tidptr, int force)
Delete the given record from the database. If force is true, then succeed even if the record doesn't currently exist.
It is not legal to pass NULL for key or zero for keylen
Errors:
CYRUSDB_IOERROR - any error to write to the database
CYRUSDB_LOCKED - if tidptr is incorrect
CYRUSDB_NOTFOUND - if force is not set and the key doesn't exist
CYRUSDB_AGAIN - if a deadlock is created. The current transaction has been aborted, but a retry may succeed
cyrusdb_commit(struct db *db, struct txn *tid)
Commit the current transaction. tid will not be valid after this call, regardless of success
If the commit fails, it will attempt to abort the transaction
Errors:
CYRUSDB_IOERROR - any error to write to the database
CYRUSDB_LOCKED - if tidptr is incorrect
CYRUSDB_AGAIN - if a deadlock is created. The current transaction has been aborted, but a retry may succeed
cyrusdb_abort(struct db *db, struct txn *tid)
Abort the current transaction. tid will not be valid after this call, regardless of success
Attempt to roll back all changes made in the current transaction.
Errors:
CYRUSDB_IOERROR - any error to write to the database
CYRUSDB_LOCKED - if tidptr is incorrect
cyrusdb_dump(struct db *db, int detail)
Optional function to dump the internal structure of the database to stdout for debugging purposes. Don't use.
cyrusdb_consistent(struct db *db)
Check if the DB is internally consistent. Looks pretty bogus, and isn't used anywhere. Don't use.