A Python package that replicates C++ STL-style data structures using the Facade Design Pattern. PythonSTL provides clean, familiar interfaces for developers coming from C++ while maintaining Pythonic best practices.
- C++ STL Compliance: Exact method names and semantics matching C++ STL
- Facade Design Pattern: Clean separation between interface and implementation
- Iterator Support: STL-style iterators (begin, end, rbegin, rend) and Python iteration
- Python Integration: Magic methods (len, bool, contains, repr, eq)
- Type Safety: Full type hints throughout the codebase
- Copy Operations: Deep copy support with copy(), copy(), and deepcopy()
- Comprehensive Documentation: Detailed docstrings with time complexity annotations
- Production Quality: Proper error handling, PEP8 compliance, and extensive testing
- Zero Dependencies: Core package has no external dependencies
pip install pythonstlOr install from source:
git clone https://github.com/AnshMNSoni/PythonSTL.git
cd PythonSTL
pip install -e .from pythonstl import stack, queue, vector, stl_set, stl_map, priority_queue
# Stack (LIFO) - Now with Python magic methods!
s = stack()
s.push(10)
s.push(20)
print(s.top()) # 20
print(len(s)) # 2 - Python len() support
print(bool(s)) # True - Python bool() support
# Vector (Dynamic Array) - With iterators!
v = vector()
v.push_back(100)
v.push_back(200)
v.push_back(300)
v.reserve(1000) # Pre-allocate capacity
print(len(v)) # 3
print(200 in v) # True - Python 'in' operator
# Iterate using STL-style iterators
for elem in v.begin():
print(elem)
# Or use Python iteration
for elem in v:
print(elem)
# Set (Unique Elements) - With magic methods
s = stl_set()
s.insert(5)
s.insert(10)
print(5 in s) # True
print(len(s)) # 2
# Map (Key-Value Pairs) - With iteration
m = stl_map()
m.insert("key1", 100)
m.insert("key2", 200)
print("key1" in m) # True
for key, value in m:
print(f"{key}: {value}")
# Priority Queue - With comparator support
pq_max = priority_queue(comparator="max") # Max-heap (default)
pq_min = priority_queue(comparator="min") # Min-heap
pq_max.push(30)
pq_max.push(10)
pq_max.push(20)
print(pq_max.top()) # 30LIFO (Last-In-First-Out) container adapter.
Methods:
push(value)- Add element to toppop()- Remove top elementtop()- Access top elementempty()- Check if emptysize()- Get number of elementscopy()- Create deep copy
Python Integration:
len(s)- Get sizebool(s)- Check if non-emptyrepr(s)- String representations1 == s2- Equality comparison
FIFO (First-In-First-Out) container adapter.
Methods:
push(value)- Add element to backpop()- Remove front elementfront()- Access front elementback()- Access back elementempty()- Check if emptysize()- Get number of elementscopy()- Create deep copy
Python Integration:
len(q)- Get sizebool(q)- Check if non-emptyrepr(q)- String representationq1 == q2- Equality comparison
Dynamic array with capacity management.
Methods:
push_back(value)- Add element to endpop_back()- Remove last elementat(index)- Access element with bounds checkinginsert(position, value)- Insert element at positionerase(position)- Remove element at positionclear()- Remove all elementsreserve(capacity)- Pre-allocate capacityshrink_to_fit()- Reduce capacity to sizesize()- Get number of elementscapacity()- Get current capacityempty()- Check if emptybegin()- Get forward iteratorend()- Get end iteratorrbegin()- Get reverse iteratorrend()- Get reverse end iteratorcopy()- Create deep copy
Python Integration:
len(v)- Get sizebool(v)- Check if non-emptyvalue in v- Check if value existsrepr(v)- String representationv1 == v2- Equality comparisonv1 < v2- Lexicographic comparisonfor elem in v- Python iteration
Associative container storing unique elements.
Methods:
insert(value)- Add elementerase(value)- Remove elementfind(value)- Check if element existsempty()- Check if emptysize()- Get number of elementsbegin()- Get iteratorend()- Get end iteratorcopy()- Create deep copy
Python Integration:
len(s)- Get sizebool(s)- Check if non-emptyvalue in s- Check if value existsrepr(s)- String representations1 == s2- Equality comparisonfor elem in s- Python iteration
Associative container storing key-value pairs.
Methods:
insert(key, value)- Add or update key-value pairerase(key)- Remove key-value pairfind(key)- Check if key existsat(key)- Access value by keyempty()- Check if emptysize()- Get number of pairsbegin()- Get iteratorend()- Get end iteratorcopy()- Create deep copy
Python Integration:
len(m)- Get sizebool(m)- Check if non-emptykey in m- Check if key existsrepr(m)- String representationm1 == m2- Equality comparisonfor key, value in m- Python iteration
Container adapter providing priority-based access.
Methods:
push(value)- Insert elementpop()- Remove top elementtop()- Access top elementempty()- Check if emptysize()- Get number of elementscopy()- Create deep copy
Comparator Support:
priority_queue(comparator="max")- Max-heap (default)priority_queue(comparator="min")- Min-heap
Python Integration:
len(pq)- Get sizebool(pq)- Check if non-emptyrepr(pq)- String representationpq1 == pq2- Equality comparison
| Container | Operation | Complexity |
|---|---|---|
| Stack | push() | O(1) amortized |
| pop() | O(1) | |
| top() | O(1) | |
| Queue | push() | O(1) |
| pop() | O(1) | |
| front() / back() | O(1) | |
| Vector | push_back() | O(1) amortized |
| pop_back() | O(1) | |
| at() | O(1) | |
| insert() | O(n) | |
| erase() | O(n) | |
| reserve() | O(1) | |
| shrink_to_fit() | O(1) | |
| Set | insert() | O(1) average |
| erase() | O(1) average | |
| find() | O(1) average | |
| Map | insert() | O(1) average |
| erase() | O(1) average | |
| find() | O(1) average | |
| at() | O(1) average | |
| Priority Queue | push() | O(log n) |
| pop() | O(log n) | |
| top() | O(1) |
PythonSTL follows the Facade Design Pattern with three layers:
-
Core Layer (
pythonstl/core/)- Base classes and type definitions
- Custom exceptions
- Iterator classes
-
Implementation Layer (
pythonstl/implementations/)- Private implementation classes (prefixed with
_) - Efficient use of Python built-ins
- Not intended for direct user access
- Private implementation classes (prefixed with
-
Facade Layer (
pythonstl/facade/)- Public-facing classes
- Clean, STL-compliant API
- Delegates to implementation layer
This architecture ensures:
- Encapsulation: Internal implementation is hidden
- Maintainability: Easy to modify internals without breaking API
- Testability: Each layer can be tested independently
Important: PythonSTL containers are NOT thread-safe by default. If you need to use them in a multi-threaded environment, you must provide your own synchronization (e.g., using threading.Lock).
import threading
from pythonstl import stack
s = stack()
lock = threading.Lock()
def thread_safe_push(value):
with lock:
s.push(value)- Clean API: Users interact with simple, well-defined interfaces
- Flexibility: Internal implementation can change without affecting users
- Type Safety: Facade layer enforces type contracts
- Error Handling: Consistent error messages across all containers
- Familiarity: C++ developers can use PythonSTL immediately
- Consistency: Predictable method names across containers
- Documentation: Extensive C++ STL documentation applies
Full Python integration while maintaining STL compatibility:
- Magic methods for natural Python usage
- Iterator protocol support
- Copy protocol support
- Maintains backward compatibility
PythonSTL includes a compiled Rust backend (built with PyO3 and Maturin) for high-performance operations, alongside pure-Python fallbacks.
When comparing PySTL to Python's built-ins, it is crucial to recognize two key system characteristics:
-
Sorted Tree-based vs. Hash-based Complexity: Python's native
dictandsetare unordered hash tables with average$O(1)$ lookup/insert complexity. PySTL'sstl_mapandsetare modeled after C++'sstd::map/std::set(usingBTreeMap/BTreeSetin Rust and anAVLTreein Python) which maintain keys in sorted order, yielding$O(\log N)$ lookup/insert complexity. Direct speed comparison between them is algorithmically an "apples-to-oranges" comparison. - FFI Boundary Crossing Overhead: For high-frequency, low-work operations (like pushing single elements), the cost of crossing the Python-Rust FFI boundary is the dominant overhead factor.
To isolate the actual performance gains of using the Rust backend, PySTL benchmarks compare the Rust-backed STL containers against their Pure Python STL container counterparts (equivalent data structures and APIs), alongside Python's built-ins as a baseline.
| Container Class & Operation | Pure Python STL | Python + Rust STL | Native Built-in | Rust Speedup (vs Pure Py STL) |
|---|---|---|---|---|
| Stack (1,000,000 push/pops) | 0.4768s | 0.3227s | 0.0530s (Native list.pop) | 1.48x faster |
| Vector (10,000 push_backs) | 0.2296s | 0.1374s | 0.0444s (Native list.append) | 1.67x faster |
| Vector (10,000 random at()) | 0.4844s | 0.3264s | 0.0586s (Native list[i]) | 1.48x faster |
| Map (10,000 insert - integers) | 0.0873s | 0.0116s | 0.0019s (Native dict[key]) | 7.53x faster |
| Map (10,000 find - integers) | 0.0077s | 0.0046s | 0.0018s (Native key in dict) | 1.68x faster |
Note: For primitive key types (like integers, floats, and strings), the Rust BTreeMap/BTreeSet uses native type-extraction fast-paths in PyObjectOrd::cmp to avoid calling back into CPython's rich comparison system.
| Algorithm Name | Pure Python (Middle Pivot) | Python + Rust | Pure C++ (O3) | Rust Speedup | Design & FFI Insights |
|---|---|---|---|---|---|
| next_permutation | 0.2413s | 0.1428s | 0.0000s | 1.7x | Lexicographical rearrangement. Limited by FFI conversions. |
| nth_element | 0.0064s | 0.0052s | 0.0000s | 1.2x | Quickselect median find. (Previously 70.85s before optimization). |
| partition | 0.0227s | 0.0164s | 0.0003s | 1.4x | Lambda-predicate partitioning. Dominated by FFI callback overhead. |
-
Algorithmic Pivot Vulnerabilities: A naive Lomuto partition (
pivot = arr[right]) causes$O(N^2)$ worst-case time on already-sorted or reversed lists (taking 70.85s). By switching PythonSTL to a middle-pivot (arr[mid]), we restore$O(N)$ average time (0.0064s).
| Search Mode / Comparator | Pure Python | Python + Rust | Pure C++ (O3) | Rust Speedup | Systems & Design Insights |
|---|---|---|---|---|---|
Standard (< comparison) |
0.0182s | 0.0034s | 0.0000s | 5.3x | Preserves |
| Custom Comparator (lambda) | 0.0180s | 0.0078s | N/A | 2.3x | Overcomes Python loop overhead despite FFI callbacks. |
-
Direct Indexing: Instead of extracting/copying the entire list (an
$O(N)$ operation), the Rust backend uses direct GIL-bound indexing (arr.get_item(mid)), maintaining the strict$O(\log N)$ search complexity.
While PythonSTL provides powerful optimizations, it is essential to understand the systems-level design characteristics and FFI boundaries when using this library:
-
FFI (Foreign Function Interface) Overhead on Granular Operations:
- For container methods like
stack.push()/pop()orqueue.push()/pop(), the Rust backend is only marginally faster (around 1.3x - 1.5x) than pure Python. - Why: Calling a compiled function from Python space millions of times introduces constant-factor FFI boundary crossing overhead (argument validation, GIL checks, etc.) that dominates the execution time.
-
Guidance: The Rust backend excels at computation-intensive algorithms (like
bubble_sortrunning 190x+ faster orbinary_searchrunning 5x+ faster) that cross the FFI boundary only once.
- For container methods like
-
Sorted Trees vs. Hash Tables (Set & Map):
- Rust-backed
stl_setandstl_mapare slower than Python's nativesetanddict. -
Why: Python's native sets/dicts are highly optimized unordered hash tables (
$O(1)$ average complexity). Replicating C++ STL associative compliance requires sorted trees (Rust'sBTreeSetandBTreeMap), which run in$O(\log N)$ time and execute callbacks into the Python VM for custom comparisons. -
Guidance: Use
stl_set/stl_mapwhen you explicitly require keys to be maintained in sorted order (e.g., for range/boundary lookups) matching standard C++ semantics, rather than simple hash-lookup speed.
- Rust-backed
Run the test suite:
# Install test dependencies
pip install pytest pytest-cov
# Run tests
pytest tests/
# Run with coverage
pytest tests/ --cov=pythonstl --cov-report=htmlgit clone https://github.com/AnshMNSoni/PythonSTL.git
cd PythonSTL
pip install -e ".[dev]"# Type checking
mypy pythonstl/
# Linting
flake8 pythonstl/
# Run all checks
pytest && mypy pythonstl/ && flake8 pythonstl/Myth 1: "This library has no actual use because online platforms (LeetCode, Codeforces) don't support external libraries."
- Reality: Correct! You cannot import external packages during live programming contests. The goal of
PythonSTLis not to be used in live contests. Instead, it serves as a local prototyping, learning, and transition tool. C++ developers moving to Python can use it locally to adapt their mental model of STL data structures to Python's syntax, and to practice structure design during mock interviews.
Myth 2: "Python's native structures are better and faster, so the Rust backend is unnecessary over-engineering."
-
Reality: While Python's built-ins are great, Python actually lacks native equivalents for some core STL behaviors:
-
No Sorted Set/Map: Python's built-in
setanddictare hash-based and unordered. To maintain a sorted collection, you'd have to sort repeatedly$\mathcal{O}(n \log n)$ .PythonSTL's Rust backend provides a true$\mathcal{O}(\log n)$ sortedBTreeSetandBTreeMap(and the pure-Python fallback uses a balanced AVL Tree). -
No Customizable Priority Queue: Python’s
heapqis strictly a min-heap, and custom comparators are difficult to write.PythonSTLprovides max/min heaps and custom sorting keys out-of-the-box. - Engineering Showcase: The Rust backend built via Maturin and PyO3 demonstrates a hybrid performance architecture. In real-world projects (like Polars, Pydantic, or cryptography libraries), performance-critical loops are written in compiled languages and bound to Python. This library serves as an educational blueprint for that pattern.
-
No Sorted Set/Map: Python's built-in
-
Reality: Incorrect. As detailed in the performance benchmarks, granular
$O(1)$ operations like single element pushes/pops onstackorqueueare dominated by FFI (Foreign Function Interface) boundary crossing overhead. The Rust backend excels in computation-intensive algorithms (like sorting, partitioning, or binary searching large arrays) where the FFI boundary is crossed only once or twice, and when type-extraction fast-paths can stay natively in Rust.
- Reality: Absolutely not. Even with the Rust backend, PySTL containers are not thread-safe. Since they store Python objects (
PyObject), Rust has to interact with the Python GIL (Global Interpreter Lock). Simultaneous mutations from multiple Python threads on the same container will lead to data races or undefined behavior unless synchronized using Python'sthreading.Lock.
-
Reality: No. They serve fundamentally different algorithmic needs. Python
set/dictare hash tables ($O(1)$ average complexity, unordered). PySTL's set/map are tree-based sorted containers ($O(\log N)$ complexity, ordered). They should only be used when keys must be kept sorted or when range query capabilities (likelower_bound/upper_bound) are needed.
- Reality: False. Because PySTL containers store arbitrary Python objects, they hold
PyObjectreferences. They participate in CPython's reference counting and garbage collection. If you create circular references, CPython's GC still has to clean them up.
MIT License - see LICENSE file for details.
Contributions are welcome! Please:
- Fork the repository
- Create a feature branch
- Add tests for new features
- Ensure all tests pass
- Submit a pull request
- GitHub: @AnshMNSoni
- Issues: GitHub Issues
- Linkedin: @anshmnsoni
PythonSTL v1.1.9 - Bringing C++ STL elegance to Python
