Unable to load the model with onnx runtime

by ti3x-m - opened Jan 20

Jan 20

Error: Uncaught 4024042896

transformer.js version: 3.3.1
https://cdn.jsdelivr.net/npm/@huggingface/[email protected]/dist/ort-wasm-simd-threaded.jsep.mjs

onnx runtime web version: 1.21.0-dev.20250109-3328eb3bb3

Code:

const modelUrl = "http://localhost:8111/OuteTTS-0.2-500M/onnx/model.onnx"; //any direct URL path to model.onnx file
const session = await ort.InferenceSession.create(modelUrl);

This error is reproduced if running a rollup bundled build, and using the example code from https://huggingface.co/onnx-community/OuteTTS-0.2-500M.

rollup bundle config on the github repo:

import json from "@rollup/plugin-json";

export default {
  input: "outetts.js/index.js",
  output: {
    file: "dist/bundle.js",
    format: "es", 
    sourcemap: true, 
  },
  plugins: [
    json()
  ],
};

Xenova

ONNX Community org Feb 10

Hi there! Can you try with v3.3.3? We fixed some bundling issues.

ti3x-m

Feb 12

@Xenova Thanks for response. I tried using import { Tensor, AutoTokenizer, AutoModelForCausalLM, PreTrainedModel } from 'https://cdn.jsdelivr.net/npm/@huggingface/[email protected]';
in the resulting bundle.js, still hitting the same error.
Uncaught 4024588040
Code i tried is similar to:
https://huggingface.co/OuteAI/OuteTTS-0.2-500M

Xenova

ONNX Community org Feb 12

That is an out-of-memory error, so I'd probably recommend loading a smaller (quantized) version. I'd recommend:

CPU: https://huggingface.co/onnx-community/OuteTTS-0.2-500M/blob/main/onnx/model_quantized.onnx
GPU w/ fp16 support: https://huggingface.co/onnx-community/OuteTTS-0.2-500M/blob/main/onnx/model_q4f16.onnx
GPU w/o fp16 support: https://huggingface.co/onnx-community/OuteTTS-0.2-500M/blob/main/onnx/model_q4.onnx

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment