Added authorization and multi access to multiple servers

2024-01-16 01:44:22 +01:00 · 2024-01-16 01:44:22 +01:00 · 9d6c4b40f0
parent 0a0cd9d2c1
commit 9d6c4b40f0
6 changed files with 128 additions and 45 deletions
--- a/README.md
+++ b/README.md
@ -1,63 +1,108 @@
-# 🌺 **Petals Server**
+# Ollama Proxy Server
-![Petals Logo](<INSERT_IMAGE_PLACEHOLDER_HERE>.png)
+Ollama Proxy Server is a lightweight reverse proxy server designed for load balancing and rate limiting. It is licensed under the Apache 2.0 license and can be installed using pip. This README covers setting up, installing, and using the Ollama Proxy Server.
-**One decentralized tool for text generation and community collaboration**
+## Prerequisites
 Make sure you have Python (>=3.8) and Apache installed on your system before proceeding.
-## Table of Contents
+## Installation
-1. [Introduction](#intro)
+1. Clone or download the `ollama_proxy_server` repository from GitHub: https://github.com/ParisNeo/ollama_proxy_server
-2. [Requirements](#requirements)
+2. Navigate to the cloned directory in the terminal and run `pip install -e .`
 3. [Installation](#installation)
 4. [Usage](#usage)
 5. [License](#license)
 6. [Contact](#contact)
 7. [Endpoints](#endpoints)
---
+## Configuration
-## 🌺 **Introduction** (<span id="intro">)</span>
+### Servers configuration (config.ini)
 Create a file named `config.ini` in the same directory as your script, containing server configurations:
 ```makefile
 [Server1]
 url = http://localhost:8080/
 queue_size = 5
-Petals is a decentralized text generation network designed to connect users with large language models, allowing them to harness the power of the community for efficient and collaborative text generation. With Petals Server, you can share your hardware resources (CPU and GPU) to contribute to the network while also utilizing it to generate text on demand.
+[Server2]
 url = http://localhost:8081/
 queue_size = 3
---
+# Add as many servers as needed, in the same format as [Server1] and [Server2].
 ```
 Replace `http://localhost:8080/` with the URL and port of the first server. The `queue_size` value indicates the maximum number of requests that can be queued at a given time for this server.
-## 🌺 **Requirements** (<span id="requirements">)</span>
+### Authorized users (authorized_users.txt)
 Create a file named `authorized_users.txt` in the same directory as your script, containing a list of user:key pairs, separated by commas and each on a new line:
 ```makefile
 user1,key1
 user2,key2
 ```
 Replace `user1`, `key1`, `user2`, and `key2` with the desired username and API key for each user.
-To get started with Petals Server, ensure you have the following prerequisites:
+## Usage
- Git for cloning the repository
+### Starting the server
- Python 3.11 or higher
+Start the Ollama Proxy Server by running the following command in your terminal:
- Operating system: Linux, macOS, or Windows with WSL (Windows Subsystem for Linux)
+```bash
 python ollama_proxy_server.py
 ```
 The server will listen on port 808x, with x being the number of available ports starting from 0 (e.g., 8080, 8081, etc.). The first available port will be automatically selected if no other instance is running.
---
+### Client requests
 To send a request to the server, use the following command:
 ```bash
 curl -X <METHOD> -H "Authorization: Bearer <USER_KEY>" http://localhost:<PORT>/<PATH> [--data <POST_DATA>]
 ```
 Replace `<METHOD>` with the HTTP method (GET or POST), `<USER_KEY>` with a valid user:key pair from your `authorized_users.txt`, `<PORT>` with the port number of your running Ollama Proxy Server, and `<PATH>` with the target endpoint URL (e.g., "/api/generate"). If you are making a POST request, include the `--data <POST_DATA>` option to send data in the body.
-## 🌺 **Installation** (<span id="installation">)</span>
+For example:
 ```bash
 curl -X POST -H "Authorization: Bearer user1:key1" http://localhost:8080/api/generate --data '{"data": "Hello, World!"}'
 ``` # Ollama Proxy Server
-Follow these steps to install Petals Server on your local machine:
+Ollama Proxy Server is a lightweight reverse proxy server designed for load balancing and rate limiting. It is licensed under the Apache 2.0 license and can be installed using pip. This README covers setting up, installing, and using the Ollama Proxy Server.
 1. Clone the Git repository using `git clone https://github.com/ParisNeo/petals_server.git`
 2. Navigate into the cloned directory (`cd petals_server`)
 3. Install dependencies with pip by running `pip install -e .`
 4. Launch the server with `petals_server`
---
+## Prerequisites
 Make sure you have Python (>=3.8) and Apache installed on your system before proceeding.
-## 🌺 **Usage** (<span id="usage">)</span>
+## Installation
 1. Clone or download the `ollama_proxy_server` repository from GitHub: https://github.com/ParisNeo/ollama_proxy_server
 2. Navigate to the cloned directory in the terminal and run `pip install -e .`
-Once installed, you can use Petals Server as a decentralized text generation client and contribute your hardware resources to the network.
+## Configuration
---
+### Servers configuration (config.ini)
 Create a file named `config.ini` in the same directory as your script, containing server configurations:
 ```makefile
 [Server1]
 url = http://localhost:8080/
 queue_size = 5
-## 🌺 **License** (<span id="license">)</span>
+[Server2]
 url = http://localhost:8081/
 queue_size = 3
-Petals Server is licensed under the [Apache License v2.0](https://www.apache.org/licenses/LICENSE-2.0).
+# Add as many servers as needed, in the same format as [Server1] and [Server2].
 ```
 Replace `http://localhost:8080/` with the URL and port of the first server. The `queue_size` value indicates the maximum number of requests that can be queued at a given time for this server.
---
+### Authorized users (authorized_users.txt)
 Create a file named `authorized_users.txt` in the same directory as your script, containing a list of user:key pairs, separated by commas and each on a new line:
 ```makefile
 user1:key1, user2:key2
 ```
 Replace `user1`, `key1`, `user2`, and `key2` with the desired username and API key for each user.
-## 🌺 **Contact** (<span id="contact">)</span>
+## Usage
 ### Starting the server
 Start the Ollama Proxy Server by running the following command in your terminal:
 ```bash
 python ollama_proxy_server.py
 ```
 The server will listen on port 808x, with x being the number of available ports starting from 0 (e.g., 8080, 8081, etc.). The first available port will be automatically selected if no other instance is running.
-For any queries or feedback, reach out to ParisNeo on Twitter (@SpaceNerduino), Discord (https://discord.gg/BDxacQmv), or subscribe to the r/lollms Subreddit for community updates and discussions.
+### Client requests
 To send a request to the server, use the following command:
 ```bash
 curl -X <METHOD> -H "Authorization: Bearer <USER_KEY>" http://localhost:<PORT>/<PATH> [--data <POST_DATA>]
 ```
 Replace `<METHOD>` with the HTTP method (GET or POST), `<USER_KEY>` with a valid user:key pair from your `authorized_users.txt`, `<PORT>` with the port number of your running Ollama Proxy Server, and `<PATH>` with the target endpoint URL (e.g., "/api/generate"). If you are making a POST request, include the `--data <POST_DATA>` option to send data in the body.
---
+For example:
-
+```bash
-## 🌺 **Endpoints** (<span id="endpoints">)</span>
+curl -X POST -H "Authorization: Bearer user1:key1" http://localhost:8080/api/generate --data '{"data": "Hello, World!"}'
-
+```
 To explore all available endpoints, navigate to `http://localhost:8000/docs`.
--- a/authorized_users.txt
+++ b/authorized_users.txt
@ -0,0 +1,3 @@
 user1,0XAXAXAQX5A1F
 user2,0XAXAXXQX5A1F
 user3,0XAXAXAXX5A1F
--- a/config.ini
+++ b/config.ini
@ -5,3 +5,4 @@ url = http://localhost:11434
 url = http://localhost:3002
 # Add more servers as you need.
--- a/ollama_proxy_server/main.py
+++ b/ollama_proxy_server/main.py
@ -6,21 +6,34 @@ from queue import Queue
 import requests
 import threading
 import argparse
-
+import base64
 from ascii_colors import ASCIIColors
 def get_config(filename):
    config = configparser.ConfigParser()
    config.read(filename)
    return [(name, {'url': config[name]['url'], 'queue': Queue()}) for name in config.sections()]
 # Read the authorized users and their keys from a file
 def get_authorized_users(filename):
    with open(filename, 'r') as f:
        lines = f.readlines()
    authorized_users = {}
    for line in lines:
        user, key = line.strip().split(',')
        authorized_users[user] = key
    return authorized_users
 def main():
    parser = argparse.ArgumentParser()
-    parser.add_argument('--config',default="config.ini", help='Path to the config file') # , required=True
+    parser.add_argument('--config',default="config.ini", help='Path to the authorized users list')
    parser.add_argument('--users_list', default="authorized_users.txt", help='Path to the config file')
    parser.add_argument('--port', type=int, default=8000, help='Port number for the server')
    args = parser.parse_args()
    servers = get_config(args.config)  
    authorized_users = get_authorized_users(args.users_list)
    class RequestHandler(BaseHTTPRequestHandler):
@ -31,16 +44,35 @@ def main():
            self.wfile.write(response.content)
        def do_GET(self):
            self.log_request()
            self.proxy()
        def do_POST(self):
            self.log_request()
            self.proxy()
        def _validate_user_and_key(self):
            # Extract the bearer token from the headers
            auth_header = self.headers.get('Authorization')
            if not auth_header or not auth_header.startswith('Bearer '):
                return False
            token = auth_header.split(' ')[1]
            user, key = token.split(':')
            # Check if the user and key are in the list of authorized users
            return authorized_users.get(user) == key
        def proxy(self):
            if not self._validate_user_and_key():
                ASCIIColors.red(f'User is not authorized')
                self.send_response(403)
                self.end_headers()
                return            
            url = urlparse(self.path)
            path = url.path
            get_params = parse_qs(url.query) or {}
            if self.command == "POST":
                content_length = int(self.headers['Content-Length'])
                post_data = self.rfile.read(content_length)
--- a/requirements.txt
+++ b/requirements.txt
@ -2,4 +2,5 @@ configparser
 queues
 requests
 urllib3
-requests
+requests
 ascii_colors
--- a/requirements_dev.txt
+++ b/requirements_dev.txt
@ -3,4 +3,5 @@ queues==6.4.0
 requests==2.27.1
 socketserver==3.5.0
 urllib3==1.26.8
-requests
+requests
 ascii_colors
`@ -5,3 +5,4 @@ url = http://localhost:11434`
	`url = http://localhost:3002`	`url = http://localhost:3002`

	`# Add more servers as you need.`	`# Add more servers as you need.`