LO
Local Model Servers
Integration & Setup Manual
Local Model Servers Integration Guide
Overview
Local model servers such as Ollama, LM Studio, llama.cpp-compatible servers, and OpenAI-compatible local endpoints are common in privacy-sensitive teams. Cup’n’String discovers local ports, attributes access, and governs which agents may connect.
Support level
Auto-Discovered + Compatibility Adapter
What Cup’n’String detects
- Known local model server processes
- OpenAI-compatible local endpoints
- Common ports and socket patterns
What it governs
- Which IDEs/agents may connect
- Exposure of local endpoints to other processes
- Tunnels and remote access
- Secrets passed to local prompts/tools
Recommended policies
- Restrict local model access to approved agents
- Block remote exposure unless explicitly allowed
- Audit prompt/tool metadata where proxying is enabled
Setup outline
- Ensure the Cup’n’String agent is active.
- Fire up your local model server of choice (e.g. Ollama).
- The compatibility adapter auto-detects the running port (e.g.
11434) and restricts direct socket access from unauthorized processes.
Verification
Try connecting to Ollama from a non-approved terminal agent, and verify that the connection is blocked.
Troubleshooting
If server is not detected, check if it runs on custom non-standard ports, and configure them explicitly in Cup’n’String client settings.
Known limitations
Model inference content visibility depends on routing through managed proxy.
Integration Info
Support Level Auto-Discovered
Additional Capabilities
Compatibility Adapter
Category AI Protocols & Gateways
Setup Complexity Low
Governed Safeguards
Network Secrets Audit
Links
Verify what categories and runtimes this stack fits inside in the global compatibility dashboard.
Supported Environments Matrix