Gap-Free Transaction Tracker

Problem Overview

The Gap-Free Transaction Tracker is designed to handle billions of transactions concurrently. The task is to track the highest completed transaction ID, ensuring that every transaction before it has also been completed. Transactions start consecutively but can complete out of order, making it essential to track the highest gap-free transaction ID, where all preceding transactions have also finished.

Key Problem Characteristics:

Consecutive Start, Out-of-Order Completion: Transactions start in a sequential order but can complete in any order.
Concurrency: Multiple threads (hundreds) may complete transactions concurrently, so the solution must efficiently handle concurrent updates.
Gap-Free Completion: The goal is to identify the highest completed transaction where all previous transactions have been completed.

Solution

This implementation uses an efficient combination of atomic operations and a mutex to ensure thread safety when accessing shared resources. The main components of the solution are:

Atomic Operations: An atomic<long> is used to track the highest gap-free transaction ID to ensure thread-safe concurrent access.
Completion Tracker: A vector of atomic<bool> values tracks which transactions have been completed. Each index in the vector corresponds to a transaction ID, and a true value at a given index signifies the completion of that transaction.
Mutex Lock: A mutex protects the critical section when updating the highest gap-free transaction ID, ensuring that only one thread can update it at a time.

Solution Walkthrough:

Transaction Completion:
The function transactionCompleted(long id) marks a transaction as complete and updates the status in a vector of atomic booleans. Then, a mutex ensures thread-safe updates to the highest gap-free transaction ID.
Concurrency Handling:
Atomic operations (store, load, fetch_add) ensure that marking transactions and checking their status is fast and thread-safe. A mutex is only used briefly when the highest gap-free transaction needs updating, reducing contention between threads.
Performance:
Using a vector of atomic<bool> ensures that marking and checking transactions are O(1) operations. The use of a mutex is minimized to prevent bottlenecks in a highly concurrent environment.

Future Improvements

While the current implementation handles concurrency efficiently, there are potential improvements that could be considered:

1. Dynamic Resizing of the Vector:

The current solution assumes a fixed size for the vector (completedTransactions). For real-world applications handling billions of transactions, a dynamic resizing mechanism could be implemented to extend the vector when needed.

2. Lock-Free Updates to Highest Gap-Free Transaction:

The use of a mutex ensures thread safety but introduces some contention. A future improvement could involve exploring lock-free algorithms (e.g., a Compare-And-Swap approach) to update the highest gap-free transaction without the need for a mutex, further improving concurrency performance.

3. Persistent Storage for Crash Recovery:

In a real-world system, ensuring the gap-free tracker’s state is persisted would be essential for crash recovery. Implementing a method to persist the tracker state to disk could be a worthwhile future enhancement.

Installation and Testing

Prerequisites

To compile and run this project, ensure that you have the following dependencies installed:

C++17 or higher (C++20 recommended)
CMake (Version 3.29 or higher)
Google Test (Automatically fetched by CMake)

Steps to Build and Run Tests

Clone the repository:

git clone <repository-url>
cd <repository-directory>

Configure the project with CMake: In the project directory, create a build folder and configure the project:
```
mkdir build
cd build
cmake ..
```
Build the project:
```
make
```
Run the tests: After building, run the tests using CTest (CMake's testing tool):
```
ctest
```
Alternatively, you can run the tests directly by executing the test binary:
```
./neo4j_gtest
```

Performance Test (500 Threads)

We have included a performance test to simulate concurrent transaction completions using 500 threads. This test validates the thread safety and performance of the GapFreeTrackerImpl class. It can be executed as part of the test suite.

Alternative Approaches

1. Lock-Free Data Structures:

A more advanced solution could involve lock-free queues or other lock-free data structures to reduce mutex contention and further improve throughput.

2. Hierarchical Locking:

Instead of a single global lock, a more sophisticated approach could involve hierarchical locking or dividing the transaction range into smaller buckets, each with its own lock. This would reduce contention and improve scalability in systems with a massive number of transactions.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
GapFreeTracker.h		GapFreeTracker.h
GapFreeTrackerGTest.cpp		GapFreeTrackerGTest.cpp
GapFreeTrackerImpl.cpp		GapFreeTrackerImpl.cpp
GapFreeTrackerImpl.h		GapFreeTrackerImpl.h
README.md		README.md
main.cpp		main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gap-Free Transaction Tracker

Problem Overview

Key Problem Characteristics:

Solution

Solution Walkthrough:

Future Improvements

1. Dynamic Resizing of the Vector:

2. Lock-Free Updates to Highest Gap-Free Transaction:

3. Persistent Storage for Crash Recovery:

Installation and Testing

Prerequisites

Steps to Build and Run Tests

Performance Test (500 Threads)

Alternative Approaches

1. Lock-Free Data Structures:

2. Hierarchical Locking:

About

Releases

Packages

Languages

skartikey/neo4j-test

Folders and files

Latest commit

History

Repository files navigation

Gap-Free Transaction Tracker

Problem Overview

Key Problem Characteristics:

Solution

Solution Walkthrough:

Future Improvements

1. Dynamic Resizing of the Vector:

2. Lock-Free Updates to Highest Gap-Free Transaction:

3. Persistent Storage for Crash Recovery:

Installation and Testing

Prerequisites

Steps to Build and Run Tests

Performance Test (500 Threads)

Alternative Approaches

1. Lock-Free Data Structures:

2. Hierarchical Locking:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages