API Doc

Index of APIs

Init Catalog
Load Catalog
Create Database
Show Databases
Select Database
Show Tables
Save Catalog
Create Table
Init Table
Init Page
Page Count
Create Page
Read Page
Write Page
Page Free Space
Add Tuple to Page
Read Item / Get Tuple

API Descriptions

0. init_catalog API

Description:
Creates catalog.json if it doesn’t exist and initializes it with an empty catalog

{"databases": {}}

Function:

pub fn init_catalog()

Implementation:

Check if database/global/catalog.json exists.
If not, create parent directories and an empty file with data json{"databases": {}}.

1. load_catalog API

Description:

Loads the catalog table metadata into memory as a Catalog struct.

Function:

pub fn load_catalog() -> Catalog

Ouput:

Returns a valid Catalog struct containing table metadata.

Implementation:

Reads the catalog file, validates its contents, and deserializes it into a Catalog struct.

2. create_database API

Description:

Creates a new database in the catalog.

Function:

pub fn create_database(catalog: &mut Catalog, db_name: &str) -> Result<(), StorageError>

Input:

catalog: in-memory catalog metadata.
db_name: Name of the new database to be created.

Ouput:

Returns Ok on success or an error if creation fails.

Implementation:

Check if the database already exists; if not, insert a new empty entry into catalog.databases.
Serialize the updated Catalog and write it to database/global/catalog.json.
Create a new directory at database/base/{db_name} for the database’s physical storage.

2. save_catalog API

Description:

Writes the in-memory Catalog structure (containing table metadata) back to disk in JSON format.

Function:

pub fn save_catalog(catalog: &Catalog)

Input:

catalog: A reference to a Catalog struct in memory that holds all table definitions.

Ouput:

Writes the catalog data to CATALOG_FILE.

Implementation:

Serializes the given Catalog struct into a JSON string and writes it to the catalog file path.

3. create_table API

Description:

Creates a new table entry in the in-memory Catalog with the provided table name and column definitions.
If the table does not already exist, it is added to the catalog and the updated catalog is written to disk in JSON format.

Function:

pub fn create_table(catalog: &mut Catalog, table_name: &str, columns: Vec<Column>) 

Input:

catalog: A mutable reference to the in-memory Catalog structure that holds metadata for all tables.
table_name: A string slice representing the name of the new table to be created.
columns: A vector of Column structs, where each struct contains the column name and data type for the new table.

Ouput:

Updates the in-memory Catalog by inserting the new table.
Persists the updated catalog to disk by writing to the CATALOG_FILE in JSON format using save_catalog API.

Implementation:

Checks if a table with the given name already exists in the catalog.
If not, creates a new Table struct using the provided columns.
Inserts the table into the catalog.tables HashMap.
Calls save_catalog(catalog) to serialize and write the updated catalog to disk.
Creates a new data file for the table in {TABLE_DIR}/{table_name}.dat.
Initializes the table file header by writing TABLE_HEADER_SIZE bytes of zeros using init_table().

4. init_table API

Description:

Initializes the Table Header by writing the first page (8192 bytes) into the table file with 0's. The first 4 bytes represent the Page Count (1).

Function:

pub fn init_table(file: &mut File)

Input: file: File pointer to update Table Header.

Output: Table header (first page) initialized with page_count = 1 in the first 4 bytes and remaining bytes set to zero.

Implementation:

Move the file cursor to the beginning of the file.
Allocate a buffer of 8192 bytes (TABLE_HEADER_SIZE) initialized to zero.
Write the entire 8192-byte buffer (including the page count) to disk, marking the creation of the first table page.
Write another 8192-byte buffer to disk to initialize the first data page along with page headers using create_page API (Page 1), which will store table tuples.

5. init_page API

Description:

Initializes the Page Header with two offset values for In Memory Page:
- Lower Offset (PAGE_HEADER_SIZE) → bytes 0..4
- Upper Offset (PAGE_SIZE) → bytes 4..8

Function:

pub fn init_page(page:&mut Page)

Input: page: In Memory Page to set Header - Lower and Upper Offsets.

Output: Page header updated with lower and upper offsets.

Implementation:

Write the lower offset (PAGE_HEADER_SIZE) into the first 4 bytes of the page header (0..4).
Write the upper offset (PAGE_SIZE) into the next 4 bytes of the page header (4..8).

6.page_count API

Description: To get total number of pages in a file

Function:

pub fn page_count(file: &mut File)

Input: file: file to calculate number of pages.

Output: Total number of pages present in the file.

Implementation:

Use the read_page() function to read the first page (page ID 0) from the file into memory.
Extract the first 4 bytes from the in-memory page buffer — these bytes represent the page count stored in the table header.
Return the first 4 bytes as page count.

7. create_page API

Description:
Create a page in disk for a file.

Function:

pub fn create_page(file: &mut File)

Input: file: file to create to a file

Output:

Create a page at the end of the file.
Update the File Header with Page Count.

Implementation:

Initializes a new page in memory using init_page API (update page header - lower and upper).
Reads the current page count from the file using the page_count API.
Moves the file cursor to the end of the file.
Writes the initialized in-memory page to the file and updates the file header by incrementing the page count stored in the first 4 bytes.

8. read_page API

Description:
Reads a page from a disk/file into memory.

Function:

pub fn read_page(file: &mut File, page: &mut Page, page_num: u32)

Input: file: file to read from, page: memory page to fill, page_num: page number to read

Output: Populates the given memory page with data read from the file.

Implementation:

Calculates the offset as (page_num * PAGE_SIZE) and moves the file cursor to the correct position.
Reads data from that offset position up to offset + PAGE_SIZE and copies it into the page memory.

Cases Handled:

Checks the file size and returns an error if the requested page does not exist in the file.

9.write_page API

Description:
Write a page from memory to disk/file.

Function:

pub fn write_page(file: &mut File, page: &mut Page, page_num: u32)

Input: file: file to write, page: memory page to copy from, page_num: page number to write

Output: Writes the contents of the given memory page to the file at the specified page offset.

Implementation:

Calculates the offset as page_num * PAGE_SIZE and moves the file cursor to the correct position.
copy the contents of the given memory page from offset to offset + PAGE_SIZE positions to the file.

10. page_free_space API

Description:
To calculate the total amount of free space left in the page.

Function:

pub fn page_free_space(page: &Page) 

Input: page: page to calculate the free space.

Output: Total amount of freespace left in the page.

Implementation:

Read the lower pointer from the first 4 bytes of the page.
Read the upper pointer from the next 4 bytes of the page.
Calculate free space = upper - lower.
Return the free space.

11. Add Tuple

Description: Adds raw data to the file.

Function:

In the buffer manager load_csv_into_pages function.

Output: Data inserted in the file.

Implementation:

Get the total number of pages in the file using page_count API.
Read the last page into memory using read_page API.
Check free space in the page using page_free_space API.
If the last page has enough free space to store the data and its ItemId (i.e., if free_space >= data.size() + ITEM_ID_SIZE): a. Calculate the insertion offset from the upper pointer. start = upper - data.len() b. Copy the data bytes into the page buffer starting at this offset. c. Update the upper pointer in the page header to the new start of free space. d. Write the ItemId entry (offset and length of the data) at the position indicated by the lower pointer. e. Update the lower pointer in the page header to account for the newly added ItemId (lower += ITEM_ID_SIZE). f. Write the updated page back to disk using write_page API.
If the last page does not have enough free space: a. Create a new page in the file and add the tuple in the new page.

Code Documentation
Reference: Postgres Internals – Page Layouts & Data

Note: Some APIs have undergone slight implementation changes during development. So, some of the implementation steps might not aligned with the actual code functions.

Index of APIs​

API Descriptions​

0. init_catalog API​

1. load_catalog API​

2. create_database API​

2. save_catalog API​

3. create_table API​

4. init_table API​

5. init_page API​

6.page_count API​

7. create_page API​

8. read_page API​

9.write_page API​

10. page_free_space API​

11. Add Tuple​

Index of APIs

API Descriptions

0. init_catalog API

1. load_catalog API

2. create_database API

2. save_catalog API

3. create_table API

4. init_table API

5. init_page API

6.page_count API

7. create_page API

8. read_page API

9.write_page API

10. page_free_space API

11. Add Tuple