mirror of
https://github.com/geoffsee/predict-otron-9001.git
synced 2025-09-08 22:46:44 +00:00
update docs
This commit is contained in:
8
crates/predict-otron-9000/README.md
Normal file
8
crates/predict-otron-9000/README.md
Normal file
@@ -0,0 +1,8 @@
|
||||
# predict-otron-9000
|
||||
|
||||
This is an extensible axum/tokio hybrid combining [embeddings-engine](../embeddings-engine), [inference-engine](../inference-engine), and [leptos-app](../leptos-app).
|
||||
|
||||
|
||||
# Notes
|
||||
- When `server_mode` is Standalone (default), the instance contains all components necessary for inference.
|
||||
- When `server_mode` is HighAvailability, automatic scaling of inference and embeddings; proxies to inference and embeddings services via dns
|
Reference in New Issue
Block a user