Performance regression tests

## Description

Add **performance regression tests** to tytanic to detect unintended slowdowns in Typst compilation and layout.

While tytanic already covers functional correctness well, there is currently no structured way to prevent performance regressions. This feature would make performance a first-class, testable property.

I created a possible solution to this down below although this approach has a problem with machine-dependent variability.

## Possible implementation

- Introduce a new test kind: `performance`
- Run performance tests multiple times to reduce noise (warmup + measured runs)
- Store reference values in `ref.json`
  - Required: reference compilation time
  - Optional (future): memory usage, layout time, total time, etc.
- Compare measured values against the reference using a configurable tolerance
  - Example: fail if regression exceeds ±5%

## Workflow

### Create a new performance test

```sh
tt new --performance performance-test
```

Resulting file structure:

```sh
.
└── tests
    └── performance-test
        ├── ref.json
        └── test.typ
```

Initial ref.json:

```json
{
  "compilation_time_ms": 0
}
````

### Implement test and record baseline

```sh
tt update performance-test
````

Output:

```sh
  Starting 1 test (run ID: e3c1682e-9cd6-4cba-90f3-aaa53089fc85)
      pass [    900ms] performance-test
──────────
   Summary [    900ms] 1/1 tests run: all passed
```

This updates ref.json to:

```json
{
  "compilation_time_ms": 240
}
```

### Detect a performance regression

After modifying the test to increase compilation time:

```sh
tt run performance-test
```

Output:

```sh
  Starting 1 test (run ID: e3c1682e-9cd6-4cba-90f3-aaa53089fc85)
      fail [    940ms] performance-test
            240ms -> 300ms (+25%)
──────────
   Summary [    940ms] 0/1 tests run: 1 failed
```

## Problems & considerations

- Machine-dependent variability
  Compilation time varies across machines and environments, so performance tests are only meaningful when run under comparable conditions.
  - Possible mitigations:
    - Store a identifier in `ref.json` for each machine that ran the test and give it its own reference compile time.
    - Just run these tests in on one specific machine.
    - Compare to compiling a previous commit. This requires a vcs though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance regression tests #264

Description

Possible implementation

Workflow

Create a new performance test

Implement test and record baseline

Detect a performance regression

Problems & considerations

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Performance regression tests #264

Description

Description

Possible implementation

Workflow

Create a new performance test

Implement test and record baseline

Detect a performance regression

Problems & considerations

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions