Added authorization and multi access to multiple servers

This commit is contained in:
Saifeddine ALOUI 2024-01-16 01:44:22 +01:00
parent 0a0cd9d2c1
commit 9d6c4b40f0
6 changed files with 128 additions and 45 deletions

127
README.md
View File

@ -1,63 +1,108 @@
# 🌺 **Petals Server** # Ollama Proxy Server
![Petals Logo](<INSERT_IMAGE_PLACEHOLDER_HERE>.png) Ollama Proxy Server is a lightweight reverse proxy server designed for load balancing and rate limiting. It is licensed under the Apache 2.0 license and can be installed using pip. This README covers setting up, installing, and using the Ollama Proxy Server.
**One decentralized tool for text generation and community collaboration** ## Prerequisites
Make sure you have Python (>=3.8) and Apache installed on your system before proceeding.
## Table of Contents ## Installation
1. [Introduction](#intro) 1. Clone or download the `ollama_proxy_server` repository from GitHub: https://github.com/ParisNeo/ollama_proxy_server
2. [Requirements](#requirements) 2. Navigate to the cloned directory in the terminal and run `pip install -e .`
3. [Installation](#installation)
4. [Usage](#usage)
5. [License](#license)
6. [Contact](#contact)
7. [Endpoints](#endpoints)
--- ## Configuration
## 🌺 **Introduction** (<span id="intro">)</span> ### Servers configuration (config.ini)
Create a file named `config.ini` in the same directory as your script, containing server configurations:
```makefile
[Server1]
url = http://localhost:8080/
queue_size = 5
Petals is a decentralized text generation network designed to connect users with large language models, allowing them to harness the power of the community for efficient and collaborative text generation. With Petals Server, you can share your hardware resources (CPU and GPU) to contribute to the network while also utilizing it to generate text on demand. [Server2]
url = http://localhost:8081/
queue_size = 3
--- # Add as many servers as needed, in the same format as [Server1] and [Server2].
```
Replace `http://localhost:8080/` with the URL and port of the first server. The `queue_size` value indicates the maximum number of requests that can be queued at a given time for this server.
## 🌺 **Requirements** (<span id="requirements">)</span> ### Authorized users (authorized_users.txt)
Create a file named `authorized_users.txt` in the same directory as your script, containing a list of user:key pairs, separated by commas and each on a new line:
```makefile
user1,key1
user2,key2
```
Replace `user1`, `key1`, `user2`, and `key2` with the desired username and API key for each user.
To get started with Petals Server, ensure you have the following prerequisites: ## Usage
- Git for cloning the repository ### Starting the server
- Python 3.11 or higher Start the Ollama Proxy Server by running the following command in your terminal:
- Operating system: Linux, macOS, or Windows with WSL (Windows Subsystem for Linux) ```bash
python ollama_proxy_server.py
```
The server will listen on port 808x, with x being the number of available ports starting from 0 (e.g., 8080, 8081, etc.). The first available port will be automatically selected if no other instance is running.
--- ### Client requests
To send a request to the server, use the following command:
```bash
curl -X <METHOD> -H "Authorization: Bearer <USER_KEY>" http://localhost:<PORT>/<PATH> [--data <POST_DATA>]
```
Replace `<METHOD>` with the HTTP method (GET or POST), `<USER_KEY>` with a valid user:key pair from your `authorized_users.txt`, `<PORT>` with the port number of your running Ollama Proxy Server, and `<PATH>` with the target endpoint URL (e.g., "/api/generate"). If you are making a POST request, include the `--data <POST_DATA>` option to send data in the body.
## 🌺 **Installation** (<span id="installation">)</span> For example:
```bash
curl -X POST -H "Authorization: Bearer user1:key1" http://localhost:8080/api/generate --data '{"data": "Hello, World!"}'
``` # Ollama Proxy Server
Follow these steps to install Petals Server on your local machine: Ollama Proxy Server is a lightweight reverse proxy server designed for load balancing and rate limiting. It is licensed under the Apache 2.0 license and can be installed using pip. This README covers setting up, installing, and using the Ollama Proxy Server.
1. Clone the Git repository using `git clone https://github.com/ParisNeo/petals_server.git`
2. Navigate into the cloned directory (`cd petals_server`)
3. Install dependencies with pip by running `pip install -e .`
4. Launch the server with `petals_server`
--- ## Prerequisites
Make sure you have Python (>=3.8) and Apache installed on your system before proceeding.
## 🌺 **Usage** (<span id="usage">)</span> ## Installation
1. Clone or download the `ollama_proxy_server` repository from GitHub: https://github.com/ParisNeo/ollama_proxy_server
2. Navigate to the cloned directory in the terminal and run `pip install -e .`
Once installed, you can use Petals Server as a decentralized text generation client and contribute your hardware resources to the network. ## Configuration
--- ### Servers configuration (config.ini)
Create a file named `config.ini` in the same directory as your script, containing server configurations:
```makefile
[Server1]
url = http://localhost:8080/
queue_size = 5
## 🌺 **License** (<span id="license">)</span> [Server2]
url = http://localhost:8081/
queue_size = 3
Petals Server is licensed under the [Apache License v2.0](https://www.apache.org/licenses/LICENSE-2.0). # Add as many servers as needed, in the same format as [Server1] and [Server2].
```
Replace `http://localhost:8080/` with the URL and port of the first server. The `queue_size` value indicates the maximum number of requests that can be queued at a given time for this server.
--- ### Authorized users (authorized_users.txt)
Create a file named `authorized_users.txt` in the same directory as your script, containing a list of user:key pairs, separated by commas and each on a new line:
```makefile
user1:key1, user2:key2
```
Replace `user1`, `key1`, `user2`, and `key2` with the desired username and API key for each user.
## 🌺 **Contact** (<span id="contact">)</span> ## Usage
### Starting the server
Start the Ollama Proxy Server by running the following command in your terminal:
```bash
python ollama_proxy_server.py
```
The server will listen on port 808x, with x being the number of available ports starting from 0 (e.g., 8080, 8081, etc.). The first available port will be automatically selected if no other instance is running.
For any queries or feedback, reach out to ParisNeo on Twitter (@SpaceNerduino), Discord (https://discord.gg/BDxacQmv), or subscribe to the r/lollms Subreddit for community updates and discussions. ### Client requests
To send a request to the server, use the following command:
```bash
curl -X <METHOD> -H "Authorization: Bearer <USER_KEY>" http://localhost:<PORT>/<PATH> [--data <POST_DATA>]
```
Replace `<METHOD>` with the HTTP method (GET or POST), `<USER_KEY>` with a valid user:key pair from your `authorized_users.txt`, `<PORT>` with the port number of your running Ollama Proxy Server, and `<PATH>` with the target endpoint URL (e.g., "/api/generate"). If you are making a POST request, include the `--data <POST_DATA>` option to send data in the body.
--- For example:
```bash
## 🌺 **Endpoints** (<span id="endpoints">)</span> curl -X POST -H "Authorization: Bearer user1:key1" http://localhost:8080/api/generate --data '{"data": "Hello, World!"}'
```
To explore all available endpoints, navigate to `http://localhost:8000/docs`.

3
authorized_users.txt Normal file
View File

@ -0,0 +1,3 @@
user1,0XAXAXAQX5A1F
user2,0XAXAXXQX5A1F
user3,0XAXAXAXX5A1F

View File

@ -5,3 +5,4 @@ url = http://localhost:11434
url = http://localhost:3002 url = http://localhost:3002
# Add more servers as you need. # Add more servers as you need.

View File

@ -6,21 +6,34 @@ from queue import Queue
import requests import requests
import threading import threading
import argparse import argparse
import base64
from ascii_colors import ASCIIColors
def get_config(filename): def get_config(filename):
config = configparser.ConfigParser() config = configparser.ConfigParser()
config.read(filename) config.read(filename)
return [(name, {'url': config[name]['url'], 'queue': Queue()}) for name in config.sections()] return [(name, {'url': config[name]['url'], 'queue': Queue()}) for name in config.sections()]
# Read the authorized users and their keys from a file
def get_authorized_users(filename):
with open(filename, 'r') as f:
lines = f.readlines()
authorized_users = {}
for line in lines:
user, key = line.strip().split(',')
authorized_users[user] = key
return authorized_users
def main(): def main():
parser = argparse.ArgumentParser() parser = argparse.ArgumentParser()
parser.add_argument('--config',default="config.ini", help='Path to the config file') # , required=True parser.add_argument('--config',default="config.ini", help='Path to the authorized users list')
parser.add_argument('--users_list', default="authorized_users.txt", help='Path to the config file')
parser.add_argument('--port', type=int, default=8000, help='Port number for the server') parser.add_argument('--port', type=int, default=8000, help='Port number for the server')
args = parser.parse_args() args = parser.parse_args()
servers = get_config(args.config) servers = get_config(args.config)
authorized_users = get_authorized_users(args.users_list)
class RequestHandler(BaseHTTPRequestHandler): class RequestHandler(BaseHTTPRequestHandler):
@ -31,16 +44,35 @@ def main():
self.wfile.write(response.content) self.wfile.write(response.content)
def do_GET(self): def do_GET(self):
self.log_request()
self.proxy() self.proxy()
def do_POST(self): def do_POST(self):
self.log_request()
self.proxy() self.proxy()
def _validate_user_and_key(self):
# Extract the bearer token from the headers
auth_header = self.headers.get('Authorization')
if not auth_header or not auth_header.startswith('Bearer '):
return False
token = auth_header.split(' ')[1]
user, key = token.split(':')
# Check if the user and key are in the list of authorized users
return authorized_users.get(user) == key
def proxy(self): def proxy(self):
if not self._validate_user_and_key():
ASCIIColors.red(f'User is not authorized')
self.send_response(403)
self.end_headers()
return
url = urlparse(self.path) url = urlparse(self.path)
path = url.path path = url.path
get_params = parse_qs(url.query) or {} get_params = parse_qs(url.query) or {}
if self.command == "POST": if self.command == "POST":
content_length = int(self.headers['Content-Length']) content_length = int(self.headers['Content-Length'])
post_data = self.rfile.read(content_length) post_data = self.rfile.read(content_length)

View File

@ -3,3 +3,4 @@ queues
requests requests
urllib3 urllib3
requests requests
ascii_colors

View File

@ -4,3 +4,4 @@ requests==2.27.1
socketserver==3.5.0 socketserver==3.5.0
urllib3==1.26.8 urllib3==1.26.8
requests requests
ascii_colors