codebert-base — CoreML (.mlpackage)

CoreML conversion of microsoft/codebert-base for native Apple Neural Engine / GPU inference on macOS and iOS.

Files

File	Description
`model.mlpackage/`	CoreML model (FP16, flexible shapes)
`tokenizer.json`	HF fast tokenizer

Details

Architecture: RoBERTa (encoder-only, no token_type_ids)
Precision: FP16 (native ANE precision)
Compute units: .all — CoreML schedules across ANE, GPU, and CPU
Input shapes: batch=1..512, seq_len=1..512 (flexible range)
Embedding dimension: 768

Usage with cai

cai index --embed-backend swift --embed-model "rsvalerio/codebert-base-coreml"

The Swift backend downloads the .mlpackage from this repo, compiles it to .mlmodelc on first run (~30-60s), and caches the compiled model for subsequent runs.

Conversion

Converted using rsvalerio/models CI pipeline with coremltools.

pip install coremltools transformers torch
python convert.py

Downloads last month: 7

Model tree for rsvalerio/codebert-base-coreml

Base model

microsoft/codebert-base

Quantized

(4)

this model