Classes
struct	clod_region_opts

Macros
#define	CLOD_REGION_VERSION 1

Enumerations
enum	clod_region_result { CLOD_REGION_OK = 0 , CLOD_REGION_INVALID_USAGE = 1 , CLOD_REGION_MALFORMED = 2 , CLOD_REGION_NOT_FOUND = 3 }

Functions
struct clod_region *	clod_region_open (const char path, const struct clod_region_opts opts)
enum clod_region_result	clod_region_read (struct clod_region region, const int64_t pos, uint8_t buff, size_t buff_size, size_t size)
enum clod_region_result	clod_region_write (struct clod_region region, const int64_t pos, const uint8_t *buff, size_t buff_size)
enum clod_region_result	clod_region_mtime (struct clod_region region, const int64_t pos, time_t *mtime)
struct clod_region_iter *	clod_region_iter_start (struct clod_region *region)
bool	clod_region_iter_next (struct clod_region_iter iter, int64_t pos)
void	clod_region_iter_end (struct clod_region_iter *iter)
enum clod_region_result	clod_region_close (struct clod_region *region)

Open modes
#define	CLOD_REGION_MODE_RDONLY 1
#define	CLOD_REGION_MODE_RDWR 2

Limits
#define	CLOD_REGION_PREFIX_MAX 30
#define	CLOD_REGION_EXTENSION_MAX 10
#define	CLOD_REGION_DIMENSIONS_MAX 10

Detailed Description

region.h defines the public interface libclod exposes for interacting with a gentrified and extended version of the minecraft region file format that retains compatibility with minecraft.

https://minecraft.wiki/w/Region_file_format

libclod extends the region file format with a header supporting dynamic features and uses that support to add checksums and uncompressed chunk size to

Thanks to ishland for his insights while brainstorming approaches to this.

Region File Format

Region file format

Libclod supports 3 region file header variants. The vanilla header, the libclod header, and a backwards compatible combination of both. The vanilla header is only supported for reading with reduced functionality, due to limitations with the format itself, e.g. no data validation or crash recovery. The libclod header is not backwards compatible. The compound header is backwards compatible and includes both headers.

Vanilla

The vanilla header does not have any dynamic storage capability. It is a fixed size header and chunk data begins directly after.

Header

Offset	Size	Description
0	4096	Chunk locations [1024]
4096	4096	Modification time in unix epoch seconds [1024]
8192	...	Chunk data

Chunk location

Offset	Size	Type
0	3	Offset in sectors
3	1	Size in sectors

Chunk data

Offset	Size	Type
0	4	Compressed chunk size in bytes
4	1	Chunk compression type
5	...	Compressed chunk data

Libclod

The libclod region file format uses NBTs to store header data; as such, the static header is smaller, although intentionally oversized, and only stores some metadata and other things that need static storage.

The static header is padded to 256 bytes, leaving some space for any future extensions that might need static storage such as shared mutexes.

All CRC-32 values use the polynomial 0x04C11DB7, reflect input and output, use 0xFFFFFFFF as the initial value, and xor with 0xFFFFFFFF to finalise. AFAICT this is the most common CRC-32 variant, so an implementation should always be close at hand.

Header

Offset	Size	Type
0	128	Human readable magic used to identify the file
128	4	CRC-32 checksum of NBT data
132	4	Size of NBT data in bytes
136	4	Generation number incremented at the start and end of every write
256	...	NBT data
...	...	Chunk data

NBT data

The idea behind the NBT structure is that implementations can store whatever they need to. Data not relevant to a given implementation is simply ignored by it, and there's backwards compatible extensibility for new features.

There are a couple limitations to NBT data, most of which stem from the fact that writes can cause other tags to move memory locations. Since dynamic storage is the goal, the alternative to copying is some kind of memory allocation scheme that will have to deal with issues like fragmentation, backwards compatability and require a complex structure to organise it all. By the end of all that you probably won't even be any faster than NBT's dumb copying anyways. Modern CPUs copy at tens of GB/s on a bad day. You'll just have an overengineered file format for the sake of being able to cache pointers a little easier.

Besides, when it comes to dynamic storage formats, we already have a reliable and well-understood in-domain format. It doesn't make sense to use something else without a concrete reason.

Root Tag

Key	Type	Description
ChunkFilePrefix	String	Prefix chunk filenames have (e.g. "c")
ChunkFileExtension	String	Extension chunk filenames have (e.g. "mcc")
Dimensions	Byte	Number of dimensions in the region file
SectorSize	Int	Sector size
Chunks	Compound	Chunk metadata

Chunks Tag

Key	Type	Description
ModificationTime	Int Array [1024]	Chunk last modification time in unix epoch seconds
FileOffset	Int Array [1024]	Location of the chunk data in the file
FileSectors	Byte Array [1024]	Chunk size in sectors
Checksum	Int Array [1024]	Checksum of chunk data
UncompressedSize	Int Array [1024]	Chunk size in bytes

Compound

The compound format aims to provide backwards compatibility with the vanilla format. It is simply both the vanilla and libclod header concatenated together, with the Chunk/ModificationTime, Chunk/FileOffset and Chunk/FileSectors arrays missing from the libclod header and chunks being stored at a larger offset to make space.

Offset	Size	Type	Description
0	8192	Vanilla header	The vanilla header
8192	...	Libclod header	The libclod header

Implementations that don't support the libclod format will see a perfectly valid and correct vanilla header and function normally; however, it's likely they will overwrite the libclod header with chunk data when writing. Libclod will see a failing magic or checksum and fall back to treating the file as a pure vanilla region file, and information in the libclod header is lost.

Notably, to facilitate backwards compatibility, some attributes must be fixed. When opening a compound header, libclod will ignore existing values and set them to the following. If the implementation uses different values for these, then backwards compatibility is broken, and the libclod header should be used instead.

Key	Value
ChunkFilePrefix	"c"
ChunkFileExtension	"mcc"
Dimensions	2
SectorSize	4096

Macro Definition Documentation

◆ CLOD_REGION_VERSION

#define CLOD_REGION_VERSION 1

Library ABI version. Backwards compatability with old ABI versions/behaviours will be ensured, at least until a major SO version bump, this identifies which ABI version the program is expecting.

Definition at line 30 of file region.h.

◆ CLOD_REGION_MODE_RDONLY

#define CLOD_REGION_MODE_RDONLY 1

Definition at line 153 of file region.h.

◆ CLOD_REGION_MODE_RDWR

#define CLOD_REGION_MODE_RDWR 2

Definition at line 154 of file region.h.

◆ CLOD_REGION_PREFIX_MAX

#define CLOD_REGION_PREFIX_MAX 30

Definition at line 159 of file region.h.

◆ CLOD_REGION_EXTENSION_MAX

#define CLOD_REGION_EXTENSION_MAX 10

Definition at line 160 of file region.h.

◆ CLOD_REGION_DIMENSIONS_MAX

#define CLOD_REGION_DIMENSIONS_MAX 10

Definition at line 161 of file region.h.

Enumeration Type Documentation

◆ clod_region_result

enum clod_region_result

Result of a call to a libregion library method. It's designed to allow programs to respond to error states and does not intend to represent debug information. The library currently uses stderr for debug information and user-relevant messages (i.e. permission denied). I do realise that might not be ideal for some cases, so I've tried to leave space to add support for a custom logger if such a thing becomes wanted.

Enumerator
CLOD_REGION_OK	No worries mate.
CLOD_REGION_INVALID_USAGE	Library used incorrectly - either directly or indirectly. File permissions errors, IO errors, system clock errors, virtual memory errors, memory allocation failures, invalid options, and many others all fall under this value.
CLOD_REGION_MALFORMED	Data is corrupted. Manual intervention is required. The program can delete the chunk to continue operation. Note Deleting a corrupted chunk can cause the deletion of other corrupted chunks.
CLOD_REGION_NOT_FOUND	The chunk does not exist. The program can write to the chunk to make it exist.

Definition at line 43 of file region.h.

Function Documentation

◆ clod_region_open()

struct clod_region * clod_region_open	(	const char *	path,
		const struct clod_region_opts *	opts )

Open a directory for region storage.

Parameters

path	Path to the directory.
opts	Configuration options.

Returns: Handle to the directory and configuration, or nullptr on error.

Definition at line 98 of file region_open.c.

◆ clod_region_read()

enum clod_region_result clod_region_read	(	struct clod_region *	region,
		const int64_t *	pos,
		uint8_t *	buff,
		size_t	buff_size,
		size_t *	size )

Read chunk data.

Parameters

[in]	Region storage	Region handle.
[in]	pos	Chunk position.
[in]	buff	The buffer where data is written to.
[in]	buff_size	Size of dst.
[out]	size	Actual size of the chunk data.

Exceptions

CLOD_REGION_OK	On success.
CLOD_REGION_INVALID_USAGE	On invalid usage.
CLOD_REGION_MALFORMED	Chunk data is corrupted.
CLOD_REGION_NOT_FOUND	Chunk does not exist.

Definition at line 6 of file region_read.c.

◆ clod_region_write()

enum clod_region_result clod_region_write	(	struct clod_region *	region,
		const int64_t *	pos,
		const uint8_t *	buff,
		size_t	buff_size )

Write chunk data or delete a chunk. Delete a chunk by passing a null buffer.

Parameters

[in]	Region storage	Region handle.
[in]	pos	Chunk position.
[in]	buff	Buffer containing chunk data to write.
[in]	buff_size	Size of the chunk data to write.

Exceptions

CLOD_REGION_OK	On success.
CLOD_REGION_INVALID_USAGE	On invalid usage.
CLOD_REGION_MALFORMED	Chunk data is corrupted.

◆ clod_region_mtime()

enum clod_region_result clod_region_mtime	(	struct clod_region *	region,
		const int64_t *	pos,
		time_t *	mtime )

Get the last modification time of the chunk.

Parameters

[in]	Region storage	Region handle.
[in]	pos	Chunk position.
[out]	mtime	Last modification time.

Exceptions

CLOD_REGION_OK	On success.
CLOD_REGION_INVALID_USAGE	On invalid usage.
CLOD_REGION_MALFORMED	Chunk data is corrupted.
CLOD_REGION_NOT_FOUND	Chunk does not exist.

◆ clod_region_iter_start()

struct clod_region_iter * clod_region_iter_start ( struct clod_region * region )

Start iterating over chunks.

Parameters

[in] Region storage Region handle.

Returns: New iterator, or nullptr on allocation failure.

◆ clod_region_iter_next()

bool clod_region_iter_next	(	struct clod_region_iter *	iter,
		int64_t *	pos )

Get the next position when iterating over chunks. This method is thread-safe and always returns a unique position.

Parameters

[in]	iter	Iterator.
[out]	pos	Next chunk position.

Returns: True if the next position existed and was returned in pos.

◆ clod_region_iter_end()

void clod_region_iter_end ( struct clod_region_iter * iter )

Release resources associated with an iterator. This method is not thread safe.

Parameters

[in] iter The iterator to free.

◆ clod_region_close()

enum clod_region_result clod_region_close ( struct clod_region * region )

Release resources associated with the region handle.

Parameters

[in] Region storage The handle to free.

Definition at line 121 of file region_open.c.

Classes

Macros

Enumerations

Functions

Open modes

Limits

Detailed Description

Region File Format

Region file format

Vanilla

Header

Chunk location

Chunk data

Libclod

Header

NBT data

Root Tag

Chunks Tag

Compound

Macro Definition Documentation

◆ CLOD_REGION_VERSION

◆ CLOD_REGION_MODE_RDONLY

◆ CLOD_REGION_MODE_RDWR

◆ CLOD_REGION_PREFIX_MAX

◆ CLOD_REGION_EXTENSION_MAX

◆ CLOD_REGION_DIMENSIONS_MAX

Enumeration Type Documentation

◆ clod_region_result

Function Documentation

◆ clod_region_open()

◆ clod_region_read()

◆ clod_region_write()

◆ clod_region_mtime()

◆ clod_region_iter_start()

◆ clod_region_iter_next()

◆ clod_region_iter_end()

◆ clod_region_close()