Added authorization and multi access to multiple servers
This commit is contained in:
parent
0a0cd9d2c1
commit
9d6c4b40f0
127
README.md
127
README.md
|
@ -1,63 +1,108 @@
|
||||||
# 🌺 **Petals Server**
|
# Ollama Proxy Server
|
||||||
|
|
||||||
![Petals Logo](<INSERT_IMAGE_PLACEHOLDER_HERE>.png)
|
Ollama Proxy Server is a lightweight reverse proxy server designed for load balancing and rate limiting. It is licensed under the Apache 2.0 license and can be installed using pip. This README covers setting up, installing, and using the Ollama Proxy Server.
|
||||||
|
|
||||||
**One decentralized tool for text generation and community collaboration**
|
## Prerequisites
|
||||||
|
Make sure you have Python (>=3.8) and Apache installed on your system before proceeding.
|
||||||
|
|
||||||
## Table of Contents
|
## Installation
|
||||||
1. [Introduction](#intro)
|
1. Clone or download the `ollama_proxy_server` repository from GitHub: https://github.com/ParisNeo/ollama_proxy_server
|
||||||
2. [Requirements](#requirements)
|
2. Navigate to the cloned directory in the terminal and run `pip install -e .`
|
||||||
3. [Installation](#installation)
|
|
||||||
4. [Usage](#usage)
|
|
||||||
5. [License](#license)
|
|
||||||
6. [Contact](#contact)
|
|
||||||
7. [Endpoints](#endpoints)
|
|
||||||
|
|
||||||
---
|
## Configuration
|
||||||
|
|
||||||
## 🌺 **Introduction** (<span id="intro">)</span>
|
### Servers configuration (config.ini)
|
||||||
|
Create a file named `config.ini` in the same directory as your script, containing server configurations:
|
||||||
|
```makefile
|
||||||
|
[Server1]
|
||||||
|
url = http://localhost:8080/
|
||||||
|
queue_size = 5
|
||||||
|
|
||||||
Petals is a decentralized text generation network designed to connect users with large language models, allowing them to harness the power of the community for efficient and collaborative text generation. With Petals Server, you can share your hardware resources (CPU and GPU) to contribute to the network while also utilizing it to generate text on demand.
|
[Server2]
|
||||||
|
url = http://localhost:8081/
|
||||||
|
queue_size = 3
|
||||||
|
|
||||||
---
|
# Add as many servers as needed, in the same format as [Server1] and [Server2].
|
||||||
|
```
|
||||||
|
Replace `http://localhost:8080/` with the URL and port of the first server. The `queue_size` value indicates the maximum number of requests that can be queued at a given time for this server.
|
||||||
|
|
||||||
## 🌺 **Requirements** (<span id="requirements">)</span>
|
### Authorized users (authorized_users.txt)
|
||||||
|
Create a file named `authorized_users.txt` in the same directory as your script, containing a list of user:key pairs, separated by commas and each on a new line:
|
||||||
|
```makefile
|
||||||
|
user1,key1
|
||||||
|
user2,key2
|
||||||
|
```
|
||||||
|
Replace `user1`, `key1`, `user2`, and `key2` with the desired username and API key for each user.
|
||||||
|
|
||||||
To get started with Petals Server, ensure you have the following prerequisites:
|
## Usage
|
||||||
- Git for cloning the repository
|
### Starting the server
|
||||||
- Python 3.11 or higher
|
Start the Ollama Proxy Server by running the following command in your terminal:
|
||||||
- Operating system: Linux, macOS, or Windows with WSL (Windows Subsystem for Linux)
|
```bash
|
||||||
|
python ollama_proxy_server.py
|
||||||
|
```
|
||||||
|
The server will listen on port 808x, with x being the number of available ports starting from 0 (e.g., 8080, 8081, etc.). The first available port will be automatically selected if no other instance is running.
|
||||||
|
|
||||||
---
|
### Client requests
|
||||||
|
To send a request to the server, use the following command:
|
||||||
|
```bash
|
||||||
|
curl -X <METHOD> -H "Authorization: Bearer <USER_KEY>" http://localhost:<PORT>/<PATH> [--data <POST_DATA>]
|
||||||
|
```
|
||||||
|
Replace `<METHOD>` with the HTTP method (GET or POST), `<USER_KEY>` with a valid user:key pair from your `authorized_users.txt`, `<PORT>` with the port number of your running Ollama Proxy Server, and `<PATH>` with the target endpoint URL (e.g., "/api/generate"). If you are making a POST request, include the `--data <POST_DATA>` option to send data in the body.
|
||||||
|
|
||||||
## 🌺 **Installation** (<span id="installation">)</span>
|
For example:
|
||||||
|
```bash
|
||||||
|
curl -X POST -H "Authorization: Bearer user1:key1" http://localhost:8080/api/generate --data '{"data": "Hello, World!"}'
|
||||||
|
``` # Ollama Proxy Server
|
||||||
|
|
||||||
Follow these steps to install Petals Server on your local machine:
|
Ollama Proxy Server is a lightweight reverse proxy server designed for load balancing and rate limiting. It is licensed under the Apache 2.0 license and can be installed using pip. This README covers setting up, installing, and using the Ollama Proxy Server.
|
||||||
1. Clone the Git repository using `git clone https://github.com/ParisNeo/petals_server.git`
|
|
||||||
2. Navigate into the cloned directory (`cd petals_server`)
|
|
||||||
3. Install dependencies with pip by running `pip install -e .`
|
|
||||||
4. Launch the server with `petals_server`
|
|
||||||
|
|
||||||
---
|
## Prerequisites
|
||||||
|
Make sure you have Python (>=3.8) and Apache installed on your system before proceeding.
|
||||||
|
|
||||||
## 🌺 **Usage** (<span id="usage">)</span>
|
## Installation
|
||||||
|
1. Clone or download the `ollama_proxy_server` repository from GitHub: https://github.com/ParisNeo/ollama_proxy_server
|
||||||
|
2. Navigate to the cloned directory in the terminal and run `pip install -e .`
|
||||||
|
|
||||||
Once installed, you can use Petals Server as a decentralized text generation client and contribute your hardware resources to the network.
|
## Configuration
|
||||||
|
|
||||||
---
|
### Servers configuration (config.ini)
|
||||||
|
Create a file named `config.ini` in the same directory as your script, containing server configurations:
|
||||||
|
```makefile
|
||||||
|
[Server1]
|
||||||
|
url = http://localhost:8080/
|
||||||
|
queue_size = 5
|
||||||
|
|
||||||
## 🌺 **License** (<span id="license">)</span>
|
[Server2]
|
||||||
|
url = http://localhost:8081/
|
||||||
|
queue_size = 3
|
||||||
|
|
||||||
Petals Server is licensed under the [Apache License v2.0](https://www.apache.org/licenses/LICENSE-2.0).
|
# Add as many servers as needed, in the same format as [Server1] and [Server2].
|
||||||
|
```
|
||||||
|
Replace `http://localhost:8080/` with the URL and port of the first server. The `queue_size` value indicates the maximum number of requests that can be queued at a given time for this server.
|
||||||
|
|
||||||
---
|
### Authorized users (authorized_users.txt)
|
||||||
|
Create a file named `authorized_users.txt` in the same directory as your script, containing a list of user:key pairs, separated by commas and each on a new line:
|
||||||
|
```makefile
|
||||||
|
user1:key1, user2:key2
|
||||||
|
```
|
||||||
|
Replace `user1`, `key1`, `user2`, and `key2` with the desired username and API key for each user.
|
||||||
|
|
||||||
## 🌺 **Contact** (<span id="contact">)</span>
|
## Usage
|
||||||
|
### Starting the server
|
||||||
|
Start the Ollama Proxy Server by running the following command in your terminal:
|
||||||
|
```bash
|
||||||
|
python ollama_proxy_server.py
|
||||||
|
```
|
||||||
|
The server will listen on port 808x, with x being the number of available ports starting from 0 (e.g., 8080, 8081, etc.). The first available port will be automatically selected if no other instance is running.
|
||||||
|
|
||||||
For any queries or feedback, reach out to ParisNeo on Twitter (@SpaceNerduino), Discord (https://discord.gg/BDxacQmv), or subscribe to the r/lollms Subreddit for community updates and discussions.
|
### Client requests
|
||||||
|
To send a request to the server, use the following command:
|
||||||
|
```bash
|
||||||
|
curl -X <METHOD> -H "Authorization: Bearer <USER_KEY>" http://localhost:<PORT>/<PATH> [--data <POST_DATA>]
|
||||||
|
```
|
||||||
|
Replace `<METHOD>` with the HTTP method (GET or POST), `<USER_KEY>` with a valid user:key pair from your `authorized_users.txt`, `<PORT>` with the port number of your running Ollama Proxy Server, and `<PATH>` with the target endpoint URL (e.g., "/api/generate"). If you are making a POST request, include the `--data <POST_DATA>` option to send data in the body.
|
||||||
|
|
||||||
---
|
For example:
|
||||||
|
```bash
|
||||||
## 🌺 **Endpoints** (<span id="endpoints">)</span>
|
curl -X POST -H "Authorization: Bearer user1:key1" http://localhost:8080/api/generate --data '{"data": "Hello, World!"}'
|
||||||
|
```
|
||||||
To explore all available endpoints, navigate to `http://localhost:8000/docs`.
|
|
|
@ -0,0 +1,3 @@
|
||||||
|
user1,0XAXAXAQX5A1F
|
||||||
|
user2,0XAXAXXQX5A1F
|
||||||
|
user3,0XAXAXAXX5A1F
|
|
@ -5,3 +5,4 @@ url = http://localhost:11434
|
||||||
url = http://localhost:3002
|
url = http://localhost:3002
|
||||||
|
|
||||||
# Add more servers as you need.
|
# Add more servers as you need.
|
||||||
|
|
||||||
|
|
|
@ -6,21 +6,34 @@ from queue import Queue
|
||||||
import requests
|
import requests
|
||||||
import threading
|
import threading
|
||||||
import argparse
|
import argparse
|
||||||
|
import base64
|
||||||
|
from ascii_colors import ASCIIColors
|
||||||
|
|
||||||
def get_config(filename):
|
def get_config(filename):
|
||||||
config = configparser.ConfigParser()
|
config = configparser.ConfigParser()
|
||||||
config.read(filename)
|
config.read(filename)
|
||||||
return [(name, {'url': config[name]['url'], 'queue': Queue()}) for name in config.sections()]
|
return [(name, {'url': config[name]['url'], 'queue': Queue()}) for name in config.sections()]
|
||||||
|
|
||||||
|
# Read the authorized users and their keys from a file
|
||||||
|
def get_authorized_users(filename):
|
||||||
|
with open(filename, 'r') as f:
|
||||||
|
lines = f.readlines()
|
||||||
|
authorized_users = {}
|
||||||
|
for line in lines:
|
||||||
|
user, key = line.strip().split(',')
|
||||||
|
authorized_users[user] = key
|
||||||
|
return authorized_users
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
def main():
|
def main():
|
||||||
parser = argparse.ArgumentParser()
|
parser = argparse.ArgumentParser()
|
||||||
parser.add_argument('--config',default="config.ini", help='Path to the config file') # , required=True
|
parser.add_argument('--config',default="config.ini", help='Path to the authorized users list')
|
||||||
|
parser.add_argument('--users_list', default="authorized_users.txt", help='Path to the config file')
|
||||||
parser.add_argument('--port', type=int, default=8000, help='Port number for the server')
|
parser.add_argument('--port', type=int, default=8000, help='Port number for the server')
|
||||||
args = parser.parse_args()
|
args = parser.parse_args()
|
||||||
servers = get_config(args.config)
|
servers = get_config(args.config)
|
||||||
|
authorized_users = get_authorized_users(args.users_list)
|
||||||
|
|
||||||
class RequestHandler(BaseHTTPRequestHandler):
|
class RequestHandler(BaseHTTPRequestHandler):
|
||||||
|
|
||||||
|
@ -31,16 +44,35 @@ def main():
|
||||||
self.wfile.write(response.content)
|
self.wfile.write(response.content)
|
||||||
|
|
||||||
def do_GET(self):
|
def do_GET(self):
|
||||||
|
self.log_request()
|
||||||
self.proxy()
|
self.proxy()
|
||||||
|
|
||||||
def do_POST(self):
|
def do_POST(self):
|
||||||
|
self.log_request()
|
||||||
self.proxy()
|
self.proxy()
|
||||||
|
|
||||||
|
def _validate_user_and_key(self):
|
||||||
|
# Extract the bearer token from the headers
|
||||||
|
auth_header = self.headers.get('Authorization')
|
||||||
|
if not auth_header or not auth_header.startswith('Bearer '):
|
||||||
|
return False
|
||||||
|
token = auth_header.split(' ')[1]
|
||||||
|
user, key = token.split(':')
|
||||||
|
|
||||||
|
# Check if the user and key are in the list of authorized users
|
||||||
|
return authorized_users.get(user) == key
|
||||||
|
|
||||||
def proxy(self):
|
def proxy(self):
|
||||||
|
if not self._validate_user_and_key():
|
||||||
|
ASCIIColors.red(f'User is not authorized')
|
||||||
|
self.send_response(403)
|
||||||
|
self.end_headers()
|
||||||
|
return
|
||||||
url = urlparse(self.path)
|
url = urlparse(self.path)
|
||||||
path = url.path
|
path = url.path
|
||||||
get_params = parse_qs(url.query) or {}
|
get_params = parse_qs(url.query) or {}
|
||||||
|
|
||||||
|
|
||||||
if self.command == "POST":
|
if self.command == "POST":
|
||||||
content_length = int(self.headers['Content-Length'])
|
content_length = int(self.headers['Content-Length'])
|
||||||
post_data = self.rfile.read(content_length)
|
post_data = self.rfile.read(content_length)
|
||||||
|
|
|
@ -3,3 +3,4 @@ queues
|
||||||
requests
|
requests
|
||||||
urllib3
|
urllib3
|
||||||
requests
|
requests
|
||||||
|
ascii_colors
|
|
@ -4,3 +4,4 @@ requests==2.27.1
|
||||||
socketserver==3.5.0
|
socketserver==3.5.0
|
||||||
urllib3==1.26.8
|
urllib3==1.26.8
|
||||||
requests
|
requests
|
||||||
|
ascii_colors
|
Loading…
Reference in New Issue