GitHub - acceleratescience/voice: Fine-tuning LLMs for stylistic alignment with GRPO

🗣️VOICE

Research software for fine-tuning large language models to match a target author's writing style, combining a calibrated stylometric evaluation suite with LoRA supervised fine-tuning and GRPO reinforcement learning.

Documentation · Report Bug · Request Feature

Table of Contents

Overview
Documentation
Installation
Quick Start
Python API
Contributing
License

Overview

VOICE is an NLP research toolkit for stylometric style alignment: fine-tuning a large language model so its outputs are stylistically consistent with a target author. The toolkit provides three integrated components:

Stylometry: a suite of surface writing style metrics (word length moments, vocabulary richness, function word frequency, character n-gram diversity) organised into four metric groups.
Evaluation: a calibrated alignment score $\mathcal{S}\in[0, 1]$ comparing model completions to a reference corpus using Wasserstein distance, normalised against within-author variation estimated from the training split via bootstrap resampling. Uncertainty estimates are provided via jackknife resampling.
Fine-tuning: a CLI for running LoRA experiments (single runs or hyperparameter sweeps) via axolotl, with style alignment scoring built in. Both supervised fine-tuning and GRPO are made available, with the latter using a custom typicality reward function.

Function word ratio distribution comparison between model completions and reference corpus

Example: Function word ratio distributions for base model and VOICE fine-tuned model completions vs. the reference corpus with the Wasserstein distance annotated.

(back to top)

Documentation

docs/
├── 00_data.md          - Datasets: format and Hugging Face references
├── 01_stylometry.md    - Stylometric metrics: definitions and catalogue
├── 02_evals.md         - Evaluation suite: scoring methodology and API
├── 03_rl.md            - Reward functions for GRPO training
└── 04_cli.md           - Fine-tuning CLI: single runs and sweeps

(back to top)

Installation

VOICE requires Python 3.12. Training functionality requires Linux (pinned axolotl version is Linux only); the evaluation and stylometry components run on all platforms.

Using uv (recommended):

git clone https://github.com/acceleratescience/voice
cd voice
uv sync
source .venv/bin/activate

Authenticate with Hugging Face before running fine-tuning jobs:

huggingface-cli login

(back to top)

Quick Start

Run a single fine-tuning job:

voice finetune single configs/single/example.yaml

This trains a LoRA adapter on top of Llama-3.1-8B-Instruct and writes per-epoch completions and alignment scores to runs/{run_name}/.

For a hyperparameter sweep:

voice finetune sweep configs/sweep/example.yaml

See docs/04_cli.md for the full CLI reference, config format and output layout.

(back to top)

Python API

The evaluation suite can be used independently of the CLI:

from voice import get_dataset, make_comparison, DatasetSpec
from voice.datasets._schema import Split

ds = get_dataset(
    DatasetSpec(
    repo_id="AccelerateScience/bo-press-conference-qa",
    splits=(Split.TRAIN, Split.VALIDATION, Split.TEST),
    )
)

# completions: list[Example] — model outputs on the same prompts as ds.validation
results = make_comparison(completions, ds)

print(results.score)           # overall alignment score in [0, 1]
print(results.group_tails)     # per-group breakdown
print(results.score_ci())      # 90% jackknife confidence interval

See docs/02_evals.md for the full scoring methodology and available diagnostic fields.

(back to top)

Contributing

Contributions are welcome. To propose a change:

Fork the repository
Create a feature branch (git checkout -b feature/my-change)
Commit your changes (git commit -m 'Add my change')
Push to the branch (git push origin feature/my-change)
Open a pull request

Please raise an issue first for substantial changes.

(back to top)

License

Distributed under the GNU General Public License. See LICENSE for details.

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 163 Commits
.github/workflows		.github/workflows
configs		configs
cspell		cspell
docs		docs
runs		runs
src/voice		src/voice
tests/voice		tests/voice
.coverage		.coverage
.coveragerc		.coveragerc
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.secrets.baseline		.secrets.baseline
LICENSE		LICENSE
README.md		README.md
cspell.json		cspell.json
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🗣️VOICE

Overview

Documentation

Installation

Quick Start

Python API

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🗣️VOICE

Overview

Documentation

Installation

Quick Start

Python API

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages