codebert-base โ€” CoreML (.mlpackage)

CoreML conversion of microsoft/codebert-base for native Apple Neural Engine / GPU inference on macOS and iOS.

Files

File Description
model.mlpackage/ CoreML model (FP16, flexible shapes)
tokenizer.json HF fast tokenizer

Details

  • Architecture: RoBERTa (encoder-only, no token_type_ids)
  • Precision: FP16 (native ANE precision)
  • Compute units: .all โ€” CoreML schedules across ANE, GPU, and CPU
  • Input shapes: batch=1..512, seq_len=1..512 (flexible range)
  • Embedding dimension: 768

Usage with cai

cai index --embed-backend swift --embed-model "rsvalerio/codebert-base-coreml"

The Swift backend downloads the .mlpackage from this repo, compiles it to .mlmodelc on first run (~30-60s), and caches the compiled model for subsequent runs.

Conversion

Converted using rsvalerio/models CI pipeline with coremltools.

pip install coremltools transformers torch
python convert.py
Downloads last month
7
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for rsvalerio/codebert-base-coreml

Quantized
(4)
this model