Files
predict-otron-9001/crates/predict-otron-9000

predict-otron-9000

This is an extensible axum/tokio hybrid combining embeddings-engine, inference-engine, and leptos-app.

Notes

  • When server_mode is Standalone (default), the instance contains all components necessary for inference.
  • When server_mode is HighAvailability, automatic scaling of inference and embeddings; proxies to inference and embeddings services via dns