Security Policy

Overview

FuzzyScorer is a .NET library designed with security-first principles. This document outlines the security measures, threat model, and best practices for using FuzzyScorer safely in production environments.

Security Features

Input Validation & Limits

All input text is validated to prevent denial-of-service attacks:

Limit	Value	Rationale
MaxWordsPerText	10,000	Prevents memory exhaustion from oversized inputs
MaxWordLength	256 characters	Limits processing overhead per word
MaxSimilarityThreshold	50	Prevents quadratic complexity in grouping algorithm

Behavior: Input exceeding limits raises ArgumentException with descriptive message.

// ✅ Valid
Scorer.GetScoringWords("small text");                    // OK
Scorer.GetScoringWords(text, similarity: 10);            // OK (≤ 50)

// ❌ Invalid
Scorer.GetScoringWords(hugeText);                        // ArgumentException: exceeds 10,000 words
Scorer.GetScoringWords(text, similarity: 100);           // ArgumentException: must be ≤ 50

Input Normalization

Before processing, input is normalized to remove attack vectors:

Character Filtering: Non-alphanumeric characters removed (except spaces, hyphens)
- Prevents injection of invisible/control characters
- Example: "hello\x00world" → "hello world"
Whitespace Handling: Normalizes \t, \n, \r to space
- Cross-platform consistency (Windows, Linux, macOS)
Word Extraction: Individual words validated for length and content
- Empty words discarded
- Words > 256 characters rejected

Immutable Objects

WordScore objects are read-only after construction:

var score = new WordScore("hello", 5);

// ❌ NOT POSSIBLE (compiler error)
// score.Text = "goodbye";
// score.Score = 10;

// ✅ CORRECT (read-only properties)
Console.WriteLine(score.Text);   // "hello"
Console.WriteLine(score.Score);  // 5

Benefits:

Thread-safe immutability
Prevents accidental data corruption
Predictable behavior in multithreaded contexts

Cancellation Support

All scoring methods accept CancellationToken for controlled resource management:

var cts = new CancellationTokenSource();

// Cancel operation after 5 seconds
cts.CancelAfter(TimeSpan.FromSeconds(5));

try
{
    var results = Scorer.GetScoringWords(largeText, 1, cts.Token);
}
catch (OperationCanceledException)
{
    // Operation was cancelled (prevents indefinite hangs)
}

Use Cases:

Server request timeouts
User-initiated cancellations
Resource management in async pipelines

No Hardcoded Secrets

Security audit confirms:

✅ No API keys, passwords, tokens, or certificates in code
✅ No external service calls (offline analysis only)
✅ No executable generation or dynamic code emission
✅ No P/Invoke or unmanaged code
✅ No BinaryFormatter or unsafe serialization

Threat Model

Protected Against

Threat	Mitigation
DoS via Large Input	Word/character count limits + `MaxWordsPerText`
DoS via Complexity	Levenshtein distance capped + similarity threshold limit
Memory Exhaustion	Input size validation before processing
Infinite Loops	`CancellationToken` support + no dynamic recursion
Malicious Characters	Input normalization (non-alphanumeric removal)
Object Mutation	Immutable `WordScore` design
Supply Chain	No external dependencies (only .NET runtime)

Not Protected Against

Threat	Reason
Zero-Day Runtime Exploits	Depends on .NET runtime security
Side-Channel Attacks	Timing analysis not mitigated
Semantic Analysis Tricks	Library performs lexical, not semantic analysis
Massive Sequential Requests	Rate limiting should be handled by calling code

Dependency Management

Direct Dependencies

ModelContextProtocol.NET.Server (v0.3.3-alpha)
- Used for: Protocol server communication
- Status: Pre-release (alpha) — review for production use
- Recommendation: Track updates; consider pinning version until stable release

Indirect Dependencies

Run vulnerability scan regularly:

dotnet list package --vulnerable

Transitive Dependencies

No automatic detection in NuGet; recommend SBOM tools:

Dependabot (GitHub): Automated dependency scanning
Snyk: Software composition analysis
CycloneDX: SBOM generation

Usage Guidelines

✅ Safe Usage

// 1. With default limits (best for web services)
try
{
    var results = Scorer.GetScoringWords(userInput);
}
catch (ArgumentException ex)
{
    // Log and return error to user
    _logger.LogWarning($"Invalid input: {ex.Message}");
    return BadRequest(ex.Message);
}

// 2. With cancellation (async contexts)
var cts = new CancellationTokenSource(TimeSpan.FromSeconds(10));
try
{
    var results = Scorer.GetScoringWords(textData, 1, cts.Token);
}
catch (OperationCanceledException)
{
    _logger.LogWarning("Scoring operation timed out");
}

// 3. Read-only access (guaranteed safety)
WordScore score = new WordScore("word", 5);
// score.Text and score.Score are read-only — no mutations possible

❌ Unsafe Usage

// DON'T: Ignore validation exceptions
var results = Scorer.GetScoringWords(untrustedData); // Can throw!

// DON'T: Pass unbounded similarity threshold
var results = Scorer.GetScoringWords(text, 10_000); // Use ≤ 50

// DON'T: No cancellation support in long operations
var results = Scorer.GetScoringWords(hugeFile); // Can hang indefinitely

Security Audit Checklist

Use this checklist for code reviews and security assessments:

Reporting Security Vulnerabilities

If you discover a security vulnerability, please do NOT open a public GitHub issue.

Instead:

Email relevant security contact with:
- Vulnerability description
- Severity assessment (Critical/High/Medium/Low)
- Proof-of-concept (if possible)
- Steps to reproduce
Allow 72 hours for initial assessment
Coordinate disclosure timeline (typically 30–90 days)

Future Improvements

Planned security enhancements:

Rate Limiting: Built-in token bucket or sliding window rate limiter
Audit Logging: Optional structured logging for security events
Metrics: Usage metrics (requests/second, average processing time)
Fuzzing: Continuous fuzzing test suite
SBOM: Generate CycloneDX bill of materials on release

References

Last Updated: 2026-02-17
Version: 1.1 (Security Hardening Release)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Security

SECURITY.md

Security Policy

Overview

Security Features

Input Validation & Limits

Input Normalization

Immutable Objects

Cancellation Support

No Hardcoded Secrets

Threat Model

Protected Against

Not Protected Against

Dependency Management

Direct Dependencies

Indirect Dependencies

Transitive Dependencies

Usage Guidelines

✅ Safe Usage

❌ Unsafe Usage

Security Audit Checklist

Reporting Security Vulnerabilities

Future Improvements

References

There aren’t any published security advisories

Security: lukaszow/FuzzyScorer

Security

SECURITY.md

Security Policy

Overview

Security Features

Input Validation & Limits

Input Normalization

Immutable Objects

Cancellation Support

No Hardcoded Secrets

Threat Model

Protected Against

Not Protected Against

Dependency Management

Direct Dependencies

Indirect Dependencies

Transitive Dependencies

Usage Guidelines

✅ Safe Usage

❌ Unsafe Usage

Security Audit Checklist

Reporting Security Vulnerabilities

Future Improvements

References

There aren’t any published security advisories