Getting Started¶

1. Installation¶

polars-expr-hopper is on PyPI. To install with standard Polars support:

pip install polars-expr-hopper[polars]

Using uv (optional)

If you prefer to use uv (recommended for a smoother developer experience), you can install with:

uv pip install polars-expr-hopper[polars]

or set up a project (e.g., uv init --app --package, uv venv, then activate the venv), and add polars-expr-hopper:

uv add polars-expr-hopper[polars]

2. Usage¶

polars-expr-hopper provides a Polars plugin that attaches a .hopper namespace to your DataFrame. Within this namespace, you can add, list, and apply Polars expressions (pl.Expr). The plugin automatically preserves your expressions across transforms like df.hopper.select(...) or df.hopper.with_columns(...), applying them as soon as the needed columns appear.

Basic Example¶

import polars as pl
import polars_expr_hopper  # registers .hopper plugin

# Create an initial DataFrame
df = pl.DataFrame({
    "user_id": [1, 2, 3, 0],
    "name": ["Alice", "Bob", "Charlie", "NullUser"],
})

# Add an expression referencing 'user_id'
df.hopper.add_filters(pl.col("user_id") != 0)

# Apply what we can; 'user_id' is present, so the filter applies now
df = df.hopper.apply_ready_filters()
print(df)
# Rows with user_id=0 are dropped.

# Add an expression referencing 'age' (not yet present)
df.hopper.add_filters(pl.col("age") > 18)

# Add the 'age' column
df2 = df.hopper.with_columns(pl.Series("age", [25, 15, 30]))

# Now apply again; only rows with age>18 remain
df2 = df2.hopper.apply_ready_filters()
print(df2)

If you need optional serialization (e.g., to store expressions in Parquet or share them across sessions), see the API reference for methods like serialize_filters() and deserialize_filters().

3. Local Development¶

Clone the Repo:

git clone https://github.com/lmmx/polars-expr-hopper.git

Install Dependencies: - If you’re using pdm:
```
pdm install
```
- Otherwise, standard pip:
```
pip install -e .
```
Optional: Pre-commit Hooks:
```
pre-commit install
```
This runs lint checks (e.g., ruff, black) before each commit.
Run Tests:
```
pytest
```
Build/Serve Docs (if included):
```
mkdocs serve
```
Then visit the local server link. Use mkdocs gh-deploy to publish on GitHub Pages.

4. Example Workflow¶

Create or load a DataFrame with Polars:

df = pl.DataFrame({"col": [1, 2, 3, 4]})

Add expressions referencing columns:

df.hopper.add_filter(pl.col("col") > 2)

Apply:

df2 = df.hopper.apply_ready_filters()
# Rows with col <= 2 are dropped

Add new columns or transformations**:

df3 = df2.hopper.with_columns(pl.Series("extra_col", [10, 20]))
# The plugin's metadata is copied forward

Re-apply if you have pending expressions referencing extra_col:
```
df3 = df3.hopper.apply_ready_filters()
```

5. Configuration¶

polars-expr-hopper primarily relies on Polars’ existing ecosystem. There is no specific environment variable or external config required.

If you plan to serialize expressions for Parquet or across sessions, note that Polars warns it may not be stable across major Polars versions.
If you see any “missing column” errors, remember the plugin only applies expressions once all required columns exist. You can check pending expressions via:
```
df.hopper.list_filters()
```

To learn more, see the API Reference or ask in Issues.