InnoDB IBD File Parser

A Python tool for parsing and analyzing InnoDB .ibd files. This tool helps database administrators and developers understand the internal structure of InnoDB tablespace files.

Features

Parse InnoDB page headers
Analyze index pages
Extract record contents
Support for various page types:
- Index pages (B-tree nodes)
- File space header
- XDES (Extent descriptor)
- BLOB pages
- And more...

Installation

From PyPI:

pip install ibd-parser

From source:

pip install git+https://github.com/likeyiyy/ibd-parser.git

Usage

Command Line Interface

# Show page header
ibd-parser -f /path/to/table.ibd header --page 4

# Dump records from an index page
ibd-parser -f /path/to/table.ibd records --page 4

Python API

from ibd_parser import IBDFileParser

# Initialize parser with your .ibd file
parser = IBDFileParser("/path/to/your/table.ibd")

# Analyze a specific page
page_info = parser.analyze_page(page_no=4)

# Access page information
print(f"Page Type: {page_info['header'].page_type}")
if 'index_header' in page_info:
    print(f"Number of records: {page_info['index_header'].n_recs}")

# Get records from an index page
records = parser.get_records(page_no=4)
for record in records:
    print(record.data)

# Hex dump of a page
parser.hex_dump(page_no=4, length=128)

Project Structure

ibd_parser/
├── ibd_parser/
│   ├── __init__.py
│   ├── constants.py    # Constants and enums
│   ├── page.py        # Page structure definitions
│   ├── record.py      # Record parsing
│   ├── parser.py      # Main parser implementation
│   ├── cli.py         # Command line interface
│   └── utils.py       # Utility functions
├── tests/
├── README.md
└── pyproject.toml     # Project metadata and dependencies

Page Structure

An InnoDB page (default 16KB) consists of:

File Header (38 bytes)
Page Header (56 bytes)
Infimum/Supremum Records
User Records
Free Space
Page Directory
File Trailer (8 bytes)

Contributing

Contributions are welcome! Please feel free to su 7590 bmit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Based on the InnoDB storage engine documentation
Inspired by various InnoDB internals research papers and blog posts

References

Future Plans

I will continue to enhance this project to make it more practical and valuable. Planned improvements include:

Support for more page types
Enhanced record analysis
Data recovery features
More comprehensive CLI tools
Better documentation and examples

Contributions and suggestions are welcome!

Testing and Development

Creating Test Data

The project includes a script to create a test database with various column types:

# Install MySQL Connector
pip install mysql-connector-python

# Set MySQL connection environment variables (if needed)
export MYSQL_USER=your_user
export MYSQL_PASSWORD=your_password
export MYSQL_HOST=localhost
export MYSQL_PORT=3306  # Or your Docker mapped port

# Create test database and table
python examples/create_test_data.py

The test table includes common MySQL data types:

Integer types (TINYINT, SMALLINT, INT, BIGINT)
Floating point types (FLOAT, DOUBLE, DECIMAL)
String types (CHAR, VARCHAR, TEXT)
Date and Time types (DATE, TIME, DATETIME, TIMESTAMP)
Other types (BOOLEAN, ENUM, BINARY, BLOB)

After creating the test database, you can find the .ibd file at:

/data/docker/mysql/data/ibd_parser_test/test_table.ibd

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
examples		examples
ibd_parser		ibd_parser
.bumpversion.cfg		.bumpversion.cfg
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

InnoDB IBD File Parser

Features

Installation

Usage

Command Line Interface

Python API

Project Structure

Page Structure

Contributing

License

Acknowledgments

References

Future Plans

Testing and Development

Creating Test Data

About

Uh oh!

Releases

Packages

Languages

License

zr-hebo/ibd-parser

Folders and files

Latest commit

History

Repository files navigation

InnoDB IBD File Parser

Features

Installation

Usage

Command Line Interface

Python API

Project Structure

Page Structure

Contributing

License

Acknowledgments

References

Future Plans

Testing and Development

Creating Test Data

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages