Spaces:

anushadudi
/

openenv-number-guessing-env

Sleeping

App Files Files Community

openenv-number-guessing-env / README.md

anushadudi

Upload folder using huggingface_hub

1747df2 verified about 1 month ago

preview code

raw

history blame contribute delete

4.16 kB

metadata

title: Number Guessing Game Environment
emoji: 🎯
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
app_port: 8000
tags:
  - openenv
  - reinforcement-learning
  - game
  - binary-search
base_path: /web

Number Guessing Game Environment

A simple OpenEnv environment where an agent learns to guess a secret number between 1 and 100 with limited attempts.

Description

The agent receives hints after each guess ("higher", "lower", or "correct") and must find the secret number within 10 attempts. This environment is perfect for:

Teaching RL agents binary search strategies
Learning the OpenEnv framework
Benchmarking simple reasoning capabilities

Environment Details

Action Space:

GuessAction with a single field:
- guess: Integer between 1 and 100

Observation Space:

GuessObservation with fields:
- hint: String ("correct", "higher", "lower", or "invalid")
- attempts_remaining: Number of guesses left
- guess_history: List of all previous guesses
- done: Boolean indicating if episode is complete
- reward: Float reward for the action

Reward Structure:

+10.0: Correct guess (episode ends)
+0.1: Valid guess that narrows the range
-1.0: Invalid guess (out of bounds)
-5.0: Failed to guess within max attempts

Episode Termination:

Agent guesses correctly
Agent runs out of attempts (10 by default)

Quick Start

Using the Client

from envs.number_guess_env import NumberGuessEnv, GuessAction

# Connect to a running server
client = NumberGuessEnv(base_url="http://localhost:8000")

# Or use Docker (automatically starts container)
# client = NumberGuessEnv.from_docker_image("number-guess-env:latest")

# Start a new game
result = client.reset()
print(result.observation.hint)  # "Guess a number between 1 and 100!"

# Make guesses
result = client.step(GuessAction(guess=50))
print(f"Hint: {result.observation.hint}")
print(f"Reward: {result.reward}")
print(f"Attempts left: {result.observation.attempts_remaining}")

# Continue until done
while not result.done:
    # Your agent logic here
    guess = 75  # Example
    result = client.step(GuessAction(guess=guess))
    print(f"Hint: {result.observation.hint}")

client.close()

Training an Agent

from envs.number_guess_env import NumberGuessEnv, GuessAction

env = NumberGuessEnv.from_docker_image("number-guess-env:latest")

for episode in range(100):
    result = env.reset()
    total_reward = 0

    # Simple binary search strategy
    low, high = 1, 100

    while not result.done:
        guess = (low + high) // 2
        result = env.step(GuessAction(guess=guess))
        total_reward += result.reward

        if result.observation.hint == "higher":
            low = guess + 1
        elif result.observation.hint == "lower":
            high = guess - 1

    print(f"Episode {episode}: Total reward = {total_reward}")

env.close()

Building and Running

Build Docker Image

docker build -t number-guess-env:latest server/

Run Server Locally

# Using uvicorn directly
cd server
uvicorn app:app --host 0.0.0.0 --port 8000

# Or using Docker
docker run -p 8000:8000 number-guess-env:latest

Test the Server

# Reset
curl -X POST http://localhost:8000/reset

# Step
curl -X POST http://localhost:8000/step \
  -H "Content-Type: application/json" \
  -d '{"guess": 50}'

# Get state
curl http://localhost:8000/state

Environment Customization

You can customize the environment parameters:

from envs.number_guess_env.server.number_guess_environment import NumberGuessEnvironment

# Custom range and attempts
env = NumberGuessEnvironment(
    max_attempts=15,
    min_number=1,
    max_number=1000
)

API Endpoints

When running as a server, the following endpoints are available:

POST /reset - Start a new game with a new secret number
POST /step - Submit a guess and receive a hint
GET /state - Get current episode state (episode_id, step_count)
GET /health - Health check endpoint
GET / - API documentation

License

BSD 3-Clause License (see LICENSE file)