NazorKit

NazorKit is a library built on top of MLX-Swift to easily integrate on-device vision language models into your iOS app.

The name "Nazor" is inspired by the Persian word "نظر" - "nazar" meaning vision/sight/gaze).

Installation

Swift Package Manager handles the distribution of Swift code and comes built into the Swift compiler.

To add NazorKit to your project, simply include it in your Package.swift file:

dependencies: [
    .package(url: "https://github.com/rryam/NazorKit.git", .upToNextMajor(from: "0.1.0"))
]

Or add NazorKit to your project through Xcode's package manager:

In Xcode, go to File > Add Packages...
Enter the package URL: https://github.com/rryam/NazorKit
Select the version or branch you want to use (e.g. main)
Click Add Package

Quick Start

Get up and running with NazorKit in minutes. Here is an example of analyzing an image:

import NazorKit
import SwiftUI

struct ContentView: View {
    @VLMServiceProvider private var vlmService
    @State private var image: UIImage?
    @State private var generatedDescription: String = ""
    
    var body: some View {
        VStack {
            if let image {
                Image(uiImage: image)
                    .resizable()
                    .scaledToFit()
                    .analyzeMedia(
                        service: vlmService,
                        prompt: "Describe this image in detail",
                        image: image
                    ) { description in
                        generatedDescription = description
                    }
                
                Text(generatedDescription)
                    .padding()
            }
        }
    }
}

Features

SwiftUI-first API design
Support for iOS 16.0+, macOS 14.0+, and visionOS 1.0+
Image analysis capabilities
Video analysis support
Built on top of MLX for efficient model inference
Customizable model configurations
Easy-to-use property wrappers and view modifiers

Basic Usage

Here's a simple example of how to analyze an image using NazorKit:

struct ContentView: View {
    @VLMServiceProvider private var vlmService
    @State private var image: UIImage?
    @State private var generatedDescription: String = ""
    
    var body: some View {
        VStack {
            if let image {
                Image(uiImage: image)
                    .resizable()
                    .scaledToFit()
                    .analyzeMedia(
                        service: vlmService,
                        prompt: "Describe this image in detail",
                        image: image
                    ) { description in
                        generatedDescription = description
                    }
                
                Text(generatedDescription)
                    .padding()
            }
        }
    }
}

Advanced Configuration

You can customize the VLM service with specific model configurations:

@VLMServiceProvider(
    configuration: .qwen2VL2BInstruct4Bit,
    generateParameters: .init(temperature: 0.8),
    maxTokens: 1000
) private var vlmService

Custom Generation Parameters

You can fine-tune the generation process with custom parameters:

let generateParameters = GenerateParameters(
    temperature: 0.8,  // Controls randomness (0.0-1.0)
    topP: 0.9          // Nucleus sampling parameter
)

@VLMServiceProvider(
    configuration: .qwen2VL2BInstruct4Bit,
    generateParameters: generateParameters,
    maxTokens: 1000
) private var vlmService

Video Analysis

NazorKit also supports video analysis:

import AVKit

struct VideoAnalysisView: View {
    @VLMServiceProvider private var vlmService
    @State private var analysis: String = ""
    let videoURL: URL
    
    var body: some View {
        VStack {
            VideoPlayer(player: AVPlayer(url: videoURL))
                .frame(height: 300)
                .analyzeMedia(
                    service: vlmService,
                    prompt: "What's happening in this video?",
                    video: videoURL
                ) { description in
                    analysis = description
                }
            
            Text(analysis)
                .padding()
        }
    }
}

Contributing

I welcome contributions to NazorKit! Here is how you can help:

Fork the repository and create a feature branch
Make your changes following the existing code style
Add tests for new functionality
Update documentation as needed
Submit a pull request with a clear description

Development Setup

Clone the repository
Open Package.swift in Xcode or VS Code forks or CLIs
Run tests to ensure everything works
Make your changes and test them

Code Style

Follow SwiftLint rules (run swiftlint lint)
Use Swift 6.0+ features where appropriate

License

NazorKit is available under the MIT license. See LICENSE for more information.

Support

Acknowledgments

Thanks to the MLX team for their excellent work on the MLX and the MLX Swift framework!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.swiftpm/xcode/package.xcworkspace		.swiftpm/xcode/package.xcworkspace
Sources/NazorKit		Sources/NazorKit
.gitignore		.gitignore
LICENSE		LICENSE
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NazorKit

Installation

Quick Start

Table of Contents

Features

Basic Usage

Advanced Configuration

Custom Generation Parameters

Video Analysis

Contributing

Development Setup

Code Style

License

Support

Acknowledgments

About

Uh oh!

Releases 1

Packages

Contributors 2

Uh oh!

Languages

License

rryam/NazorKit

Folders and files

Latest commit

History

Repository files navigation

NazorKit

Installation

Quick Start

Table of Contents

Features

Basic Usage

Advanced Configuration

Custom Generation Parameters

Video Analysis

Contributing

Development Setup

Code Style

License

Support

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages