.servers accessor provides methods for creating, listing, starting, and stopping servers.
Some methods like
.match() and the parameter classes below are backend-specific and may not be available on all backends.Creating servers
The.create() method creates a new server configuration. By default, it downloads the model file (if not already downloaded) and selects a random, unused port for the server.
If a server with the specified configuration is already running, the existing configuration is returned, and no new server is created.
- Using model string reference
- Using QuantizedFile object
Server configuration parameters
The.create() method accepts an extra_options parameter to customize server behavior:
Server configuration options are backend-specific. The parameters below apply to the
ai-navigator and anaconda-desktop backends.Managing servers
New servers are not automatically started when their configuration is created. You can start or stop a server using the following methods:- Using the server object
- Using the .servers accessor
- Using server as a context manager