From 7b391aead46a90916c7fb952d9017120d814b3ed Mon Sep 17 00:00:00 2001
From: Asankhaya Sharma <codelion@users.noreply.github.com>
Date: Wed, 17 Sep 2025 10:56:07 +0800
Subject: [PATCH 1/6] Update README.md

---
 README.md | 213 +++++++++++++++++++++++++++++++++++++++++++++++++++++-
 1 file changed, 212 insertions(+), 1 deletion(-)
diff --git a/README.md b/README.md
index 723027ec3..3aaea0ef7 100644
--- a/README.md
+++ b/README.md
@@ -15,7 +15,7 @@
   <a href="https://github.com/codelion/openevolve/blob/main/LICENSE"><img src="https://img.shields.io/github/license/codelion/openevolve" alt="License"></a>
 </p>
 
-[🚀 **Quick Start**](#-quick-start) • [📖 **Examples**](#-examples-gallery) • [💬 **Discussions**](https://github.com/codelion/openevolve/discussions)
+[🚀 **Quick Start**](#-quick-start) • [📖 **Examples**](#-examples-gallery) • [📝 **System Messages**](#-crafting-effective-system-messages) • [💬 **Discussions**](https://github.com/codelion/openevolve/discussions)
 
 *From random search to state-of-the-art: Watch your code evolve in real-time*
 
@@ -516,6 +516,217 @@ See [prompt examples](examples/llm_prompt_optimization/templates/) for complete
 
 </details>
 
+## 📝 Crafting Effective System Messages
+
+**System messages are the secret to successful evolution.** They guide the LLM's understanding of your domain, constraints, and optimization goals. A well-crafted system message can be the difference between random mutations and targeted improvements.
+
+### Why System Messages Matter
+
+The system message in your config.yaml is arguably the most important component for evolution success:
+
+- **Domain Expertise**: Provides LLM with specific knowledge about your problem space
+- **Constraint Awareness**: Defines what can and cannot be changed during evolution
+- **Optimization Focus**: Guides the LLM toward meaningful improvements
+- **Error Prevention**: Helps avoid common pitfalls and compilation errors
+
+### The Iterative Creation Process
+
+Based on successful OpenEvolve implementations, system messages are best created through iteration:
+
+<details>
+<summary><b>🔄 Step-by-Step Process</b></summary>
+
+**Phase 1: Initial Draft**
+1. Start with a basic system message describing your goal
+2. Run 20-50 evolution iterations to observe behavior
+3. Note where the system gets "stuck" or makes poor choices
+
+**Phase 2: Refinement**
+4. Add specific guidance based on observed issues
+5. Include domain-specific terminology and concepts
+6. Define clear constraints and optimization targets
+7. Run another batch of iterations
+
+**Phase 3: Specialization**
+8. Add detailed examples of good vs bad approaches
+9. Include specific library/framework guidance
+10. Add error avoidance patterns you've observed
+11. Fine-tune based on artifact feedback
+
+**Phase 4: Optimization**
+12. Consider using OpenEvolve itself to optimize your prompt
+13. Measure improvements using combined score metrics
+
+</details>
+
+### Examples by Complexity
+
+#### 🎯 **Simple: General Optimization**
+```yaml
+prompt:
+  system_message: |
+    You are an expert programmer specializing in optimization algorithms.
+    Your task is to improve a function minimization algorithm to find the
+    global minimum reliably, escaping local minima that might trap simple algorithms.
+```
+
+#### 🔧 **Intermediate: Domain-Specific Guidance**
+```yaml
+prompt:
+  system_message: |
+    You are an expert prompt engineer. Your task is to revise prompts for LLMs.
+
+    Your improvements should:
+    * Clarify vague instructions and eliminate ambiguity
+    * Strengthen alignment between prompt and desired task outcome
+    * Improve robustness against edge cases
+    * Include formatting instructions and examples where helpful
+    * Avoid unnecessary verbosity
+
+    Return only the improved prompt text without explanations.
+```
+
+#### ⚡ **Advanced: Hardware-Specific Optimization**
+```yaml
+prompt:
+  system_message: |
+    You are an expert Metal GPU programmer specializing in custom attention
+    kernels for Apple Silicon.
+
+    # TARGET: Optimize Metal Kernel for Grouped Query Attention (GQA)
+    # HARDWARE: Apple M-series GPUs with unified memory architecture
+    # GOAL: 5-15% performance improvement
+
+    # OPTIMIZATION OPPORTUNITIES:
+    **1. Memory Access Pattern Optimization:**
+    - Coalesced access patterns for Apple Silicon
+    - Vectorized loading using SIMD
+    - Pre-compute frequently used indices
+
+    **2. Algorithm Fusion:**
+    - Combine max finding with score computation
+    - Reduce number of passes through data
+
+    # CONSTRAINTS - CRITICAL SAFETY RULES:
+    **MUST NOT CHANGE:**
+    ❌ Kernel function signature
+    ❌ Template parameter names or types
+    ❌ Overall algorithm correctness
+
+    **ALLOWED TO OPTIMIZE:**
+    ✅ Memory access patterns and indexing
+    ✅ Computation order and efficiency
+    ✅ Vectorization and SIMD utilization
+    ✅ Apple Silicon specific optimizations
+```
+
+### Best Practices
+
+<details>
+<summary><b>🎨 Prompt Engineering Patterns</b></summary>
+
+**Structure Your Message:**
+- Start with role definition ("You are an expert...")
+- Define the specific task and context
+- List optimization opportunities with examples
+- Set clear constraints and safety rules
+- End with success criteria
+
+**Use Specific Examples:**
+```yaml
+# Good: Specific optimization targets
+system_message: |
+  Focus on reducing memory allocations in the hot loop.
+  Example: Replace `new Vector()` with pre-allocated arrays.
+
+# Avoid: Vague guidance
+system_message: "Make the code faster"
+```
+
+**Include Domain Knowledge:**
+```yaml
+# Good: Domain-specific guidance
+system_message: |
+  For GPU kernels, prioritize:
+  1. Memory coalescing (access patterns)
+  2. Occupancy (thread utilization)
+  3. Shared memory usage (cache blocking)
+
+# Avoid: Generic optimization advice
+system_message: "Optimize the algorithm"
+```
+
+**Set Clear Boundaries:**
+```yaml
+system_message: |
+  MUST NOT CHANGE:
+  ❌ Function signatures
+  ❌ Algorithm correctness
+  ❌ External API compatibility
+
+  ALLOWED TO OPTIMIZE:
+  ✅ Internal implementation details
+  ✅ Data structures and algorithms
+  ✅ Performance optimizations
+```
+
+</details>
+
+<details>
+<summary><b>🔬 Advanced Techniques</b></summary>
+
+**Artifact-Driven Iteration:**
+- Enable artifacts in your config
+- Include common error patterns in system message
+- Add guidance based on stderr/warning patterns
+
+**Multi-Phase Evolution:**
+```yaml
+# Phase 1: Broad exploration
+system_message: "Explore different algorithmic approaches..."
+
+# Phase 2: Focused optimization
+system_message: "Given the successful simulated annealing approach,
+focus on parameter tuning and cooling schedules..."
+```
+
+**Template Variation:**
+```yaml
+prompt:
+  template_dir: "custom_templates/"
+  use_template_stochasticity: true
+  system_message: |
+    # Use multiple greeting variations
+    [Randomly: "Let's optimize this code:" | "Time to enhance:" | "Improving:"]
+```
+
+</details>
+
+### Meta-Evolution: Using OpenEvolve to Optimize Prompts
+
+**You can use OpenEvolve to evolve your system messages themselves!**
+
+```yaml
+# Example: Evolve prompts for HotpotQA dataset
+Initial Prompt: "Answer the question based on the context."
+
+Evolved Prompt: "As an expert analyst, carefully examine the provided context.
+Break down complex multi-hop reasoning into clear steps. Cross-reference
+information from multiple sources to ensure accuracy. Answer: [question]"
+
+Result: +23% accuracy improvement on HotpotQA benchmark
+```
+
+See the [LLM Prompt Optimization example](examples/llm_prompt_optimization/) for a complete implementation.
+
+### Common Pitfalls to Avoid
+
+- **Too Vague**: "Make the code better" → Specify exactly what "better" means
+- **Too Restrictive**: Over-constraining can prevent useful optimizations
+- **Missing Context**: Include relevant domain knowledge and terminology
+- **No Examples**: Concrete examples guide LLM better than abstract descriptions
+- **Ignoring Artifacts**: Don't refine prompts based on error feedback
+
 ## 🔧 Artifacts & Debugging
 
 **Artifacts side-channel** provides rich feedback to accelerate evolution:

From 508ffdaa2a3f373ca83bf4fcc61a9d67116f6057 Mon Sep 17 00:00:00 2001
From: Asankhaya Sharma <codelion@users.noreply.github.com>
Date: Wed, 17 Sep 2025 11:01:19 +0800
Subject: [PATCH 2/6] Update README.md

---
 README.md | 13 +++++++++++++
 1 file changed, 13 insertions(+)

diff --git a/README.md b/README.md
index 3aaea0ef7..f218a650c 100644
--- a/README.md
+++ b/README.md
@@ -537,23 +537,27 @@ Based on successful OpenEvolve implementations, system messages are best created
 <summary><b>🔄 Step-by-Step Process</b></summary>
 
 **Phase 1: Initial Draft**
+
 1. Start with a basic system message describing your goal
 2. Run 20-50 evolution iterations to observe behavior
 3. Note where the system gets "stuck" or makes poor choices
 
 **Phase 2: Refinement**
+
 4. Add specific guidance based on observed issues
 5. Include domain-specific terminology and concepts
 6. Define clear constraints and optimization targets
 7. Run another batch of iterations
 
 **Phase 3: Specialization**
+
 8. Add detailed examples of good vs bad approaches
 9. Include specific library/framework guidance
 10. Add error avoidance patterns you've observed
 11. Fine-tune based on artifact feedback
 
 **Phase 4: Optimization**
+
 12. Consider using OpenEvolve itself to optimize your prompt
 13. Measure improvements using combined score metrics
 
@@ -626,6 +630,7 @@ prompt:
 <summary><b>🎨 Prompt Engineering Patterns</b></summary>
 
 **Structure Your Message:**
+
 - Start with role definition ("You are an expert...")
 - Define the specific task and context
 - List optimization opportunities with examples
@@ -633,6 +638,7 @@ prompt:
 - End with success criteria
 
 **Use Specific Examples:**
+
 ```yaml
 # Good: Specific optimization targets
 system_message: |
@@ -644,6 +650,7 @@ system_message: "Make the code faster"
 ```
 
 **Include Domain Knowledge:**
+
 ```yaml
 # Good: Domain-specific guidance
 system_message: |
@@ -657,6 +664,7 @@ system_message: "Optimize the algorithm"
 ```
 
 **Set Clear Boundaries:**
+
 ```yaml
 system_message: |
   MUST NOT CHANGE:
@@ -676,11 +684,13 @@ system_message: |
 <summary><b>🔬 Advanced Techniques</b></summary>
 
 **Artifact-Driven Iteration:**
+
 - Enable artifacts in your config
 - Include common error patterns in system message
 - Add guidance based on stderr/warning patterns
 
 **Multi-Phase Evolution:**
+
 ```yaml
 # Phase 1: Broad exploration
 system_message: "Explore different algorithmic approaches..."
@@ -691,6 +701,7 @@ focus on parameter tuning and cooling schedules..."
 ```
 
 **Template Variation:**
+
 ```yaml
 prompt:
   template_dir: "custom_templates/"
@@ -747,6 +758,7 @@ return EvaluationResult(
 ```
 
 **Next generation prompt automatically includes:**
+
 ```markdown
 ## Previous Execution Feedback
 ⚠️ Warning: suboptimal memory access pattern
@@ -772,6 +784,7 @@ python scripts/visualizer.py --path examples/function_minimization/openevolve_ou
 ```
 
 **Features:**
+
 - 🌳 **Evolution tree** with parent-child relationships
 - 📈 **Performance tracking** across generations
 - 🔍 **Code diff viewer** showing mutations

From 367bb5634ea3e2948b2305720b1283d574eb2727 Mon Sep 17 00:00:00 2001
From: Asankhaya Sharma <codelion@users.noreply.github.com>
Date: Wed, 17 Sep 2025 11:03:51 +0800
Subject: [PATCH 3/6] Update README.md

---
 README.md | 17 +++++++++++++----
 1 file changed, 13 insertions(+), 4 deletions(-)

diff --git a/README.md b/README.md
index f218a650c..610b3bac9 100644
--- a/README.md
+++ b/README.md
@@ -510,8 +510,14 @@ prompt:
       - "Let's enhance this code:"
       - "Time to optimize:"
       - "Improving the algorithm:"
+    improvement_suggestion:
+      - "Here's how we could improve this code:"
+      - "I suggest the following improvements:"
+      - "We can enhance this code by:"
 ```
 
+**How it works:** Place `{greeting}` or `{improvement_suggestion}` placeholders in your templates, and OpenEvolve will randomly choose from the variations for each generation, adding diversity to prompts.
+
 See [prompt examples](examples/llm_prompt_optimization/templates/) for complete template customization.
 
 </details>
@@ -700,15 +706,18 @@ system_message: "Given the successful simulated annealing approach,
 focus on parameter tuning and cooling schedules..."
 ```
 
-**Template Variation:**
+**Template Stochasticity:**
 
 ```yaml
 prompt:
   template_dir: "custom_templates/"
   use_template_stochasticity: true
-  system_message: |
-    # Use multiple greeting variations
-    [Randomly: "Let's optimize this code:" | "Time to enhance:" | "Improving:"]
+  template_variations:
+    greeting:
+      - "Let's optimize this code:"
+      - "Time to enhance:"
+      - "Improving:"
+  # Then use {greeting} in your templates to get random variations
 ```
 
 </details>

From 165a77ae778fa8eba3996c06c36933d5d869e573 Mon Sep 17 00:00:00 2001
From: Asankhaya Sharma <codelion@users.noreply.github.com>
Date: Wed, 17 Sep 2025 11:14:10 +0800
Subject: [PATCH 4/6] Update README.md

---
 README.md | 149 +++++++++++++++++-------------------------------------
 1 file changed, 45 insertions(+), 104 deletions(-)

diff --git a/README.md b/README.md
index 610b3bac9..5e9b239bf 100644
--- a/README.md
+++ b/README.md
@@ -64,7 +64,7 @@ Full reproducibility, extensive evaluation pipelines, and scientific rigor built
 
 | 🎯 **Domain** | 📈 **Achievement** | 🔗 **Example** |
 |---------------|-------------------|----------------|
-| **GPU Optimization** | 2-3x speedup on Apple Silicon | [MLX Metal Kernels](examples/mlx_metal_kernel_opt/) |
+| **GPU Optimization** | Hardware-optimized kernel discovery | [MLX Metal Kernels](examples/mlx_metal_kernel_opt/) |
 | **Mathematical** | State-of-the-art circle packing (n=26) | [Circle Packing](examples/circle_packing/) |
 | **Algorithm Design** | Adaptive sorting algorithms | [Rust Adaptive Sort](examples/rust_adaptive_sort/) |
 | **Scientific Computing** | Automated filter design | [Signal Processing](examples/signal_processing/) |
@@ -127,12 +127,7 @@ result = evolve_function(
 print(f"Evolved sorting algorithm: {result.best_code}")
 ```
 
-**Prefer Docker?**
-```bash
-docker run --rm -v $(pwd):/app ghcr.io/codelion/openevolve:latest \
-  examples/function_minimization/initial_program.py \
-  examples/function_minimization/evaluator.py --iterations 100
-```
+**Prefer Docker?** See the [Installation & Setup](#-installation--setup) section for Docker options.
 
 ## 🎬 See It In Action
 
@@ -207,10 +202,10 @@ OpenEvolve implements a sophisticated **evolutionary coding pipeline** that goes
 <details>
 <summary><b>🤖 Advanced LLM Integration</b></summary>
 
-- **Test-Time Compute**: Integration with [OptiLLM](https://github.com/codelion/optillm) for MoA and enhanced reasoning
-- **Universal API**: Works with OpenAI, Google, local models
-- **Plugin Ecosystem**: Support for OptiLLM plugins (readurls, executecode, z3_solver)
+- **Universal API**: Works with OpenAI, Google, local models, and proxies
 - **Intelligent Ensembles**: Weighted combinations with sophisticated fallback
+- **Test-Time Compute**: Enhanced reasoning through proxy systems (see [OptiLLM setup](#llm-provider-setup))
+- **Plugin Ecosystem**: Support for advanced reasoning plugins
 
 </details>
 
@@ -267,11 +262,34 @@ pip install -e ".[dev]"
 <summary><b>🐳 Docker</b></summary>
 
 ```bash
+# Pull the image
 docker pull ghcr.io/codelion/openevolve:latest
+
+# Run an example
+docker run --rm -v $(pwd):/app ghcr.io/codelion/openevolve:latest \
+  examples/function_minimization/initial_program.py \
+  examples/function_minimization/evaluator.py --iterations 100
 ```
 
 </details>
 
+### Cost Estimation
+
+**Cost depends on your LLM provider and iterations:**
+
+- **o3**: ~$0.15-0.60 per iteration (depending on code size)
+- **o3-mini**: ~$0.03-0.12 per iteration (more cost-effective)
+- **Gemini-2.5-Pro**: ~$0.08-0.30 per iteration
+- **Gemini-2.5-Flash**: ~$0.01-0.05 per iteration (fastest and cheapest)
+- **Local models**: Nearly free after setup
+- **OptiLLM**: Use cheaper models with test-time compute for better results
+
+**Cost-saving tips:**
+- Start with fewer iterations (100-200)
+- Use o3-mini, Gemini-2.5-Flash or local models for exploration
+- Use cascade evaluation to filter bad programs early
+- Configure smaller population sizes initially
+
 ### LLM Provider Setup
 
 OpenEvolve works with **any OpenAI-compatible API**:
@@ -347,7 +365,7 @@ llm:
 | Project | Domain | Achievement | Demo |
 |---------|--------|-------------|------|
 | [🎯 **Function Minimization**](examples/function_minimization/) | Optimization | Random → Simulated Annealing | [View Results](examples/function_minimization/openevolve_output/) |
-| [⚡ **MLX GPU Kernels**](examples/mlx_metal_kernel_opt/) | Hardware | 2-3x Apple Silicon speedup | [Benchmarks](examples/mlx_metal_kernel_opt/README.md) |
+| [⚡ **MLX GPU Kernels**](examples/mlx_metal_kernel_opt/) | Hardware | Apple Silicon optimization | [Benchmarks](examples/mlx_metal_kernel_opt/README.md) |
 | [🔄 **Rust Adaptive Sort**](examples/rust_adaptive_sort/) | Algorithms | Data-aware sorting | [Code Evolution](examples/rust_adaptive_sort/) |
 | [📐 **Symbolic Regression**](examples/symbolic_regression/) | Science | Automated equation discovery | [LLM-SRBench](examples/symbolic_regression/) |
 | [🕸️ **Web Scraper + OptiLLM**](examples/web_scraper_optillm/) | AI Integration | Test-time compute optimization | [Smart Scraping](examples/web_scraper_optillm/) |
@@ -442,11 +460,11 @@ max_iterations: 1000
 random_seed: 42  # Full reproducibility
 
 llm:
-  # Ensemble with test-time compute
+  # Ensemble configuration
   models:
     - name: "gemini-2.5-pro"
       weight: 0.6
-    - name: "moa&readurls-o3"  # OptiLLM features
+    - name: "gemini-2.5-flash"
       weight: 0.4
   temperature: 0.7
 
@@ -635,53 +653,25 @@ prompt:
 <details>
 <summary><b>🎨 Prompt Engineering Patterns</b></summary>
 
-**Structure Your Message:**
-
-- Start with role definition ("You are an expert...")
-- Define the specific task and context
-- List optimization opportunities with examples
-- Set clear constraints and safety rules
-- End with success criteria
+**Structure Your Message:** Start with role definition → Define task/context → List optimization opportunities → Set constraints → Success criteria
 
 **Use Specific Examples:**
-
 ```yaml
-# Good: Specific optimization targets
-system_message: |
-  Focus on reducing memory allocations in the hot loop.
-  Example: Replace `new Vector()` with pre-allocated arrays.
-
-# Avoid: Vague guidance
-system_message: "Make the code faster"
+# Good: "Focus on reducing memory allocations. Example: Replace `new Vector()` with pre-allocated arrays."
+# Avoid: "Make the code faster"
 ```
 
 **Include Domain Knowledge:**
-
 ```yaml
-# Good: Domain-specific guidance
-system_message: |
-  For GPU kernels, prioritize:
-  1. Memory coalescing (access patterns)
-  2. Occupancy (thread utilization)
-  3. Shared memory usage (cache blocking)
-
-# Avoid: Generic optimization advice
-system_message: "Optimize the algorithm"
+# Good: "For GPU kernels: 1) Memory coalescing 2) Occupancy 3) Shared memory usage"
+# Avoid: "Optimize the algorithm"
 ```
 
 **Set Clear Boundaries:**
-
 ```yaml
 system_message: |
-  MUST NOT CHANGE:
-  ❌ Function signatures
-  ❌ Algorithm correctness
-  ❌ External API compatibility
-
-  ALLOWED TO OPTIMIZE:
-  ✅ Internal implementation details
-  ✅ Data structures and algorithms
-  ✅ Performance optimizations
+  MUST NOT CHANGE: ❌ Function signatures ❌ Algorithm correctness ❌ External API
+  ALLOWED: ✅ Internal implementation ✅ Data structures ✅ Performance optimizations
 ```
 
 </details>
@@ -689,55 +679,19 @@ system_message: |
 <details>
 <summary><b>🔬 Advanced Techniques</b></summary>
 
-**Artifact-Driven Iteration:**
-
-- Enable artifacts in your config
-- Include common error patterns in system message
-- Add guidance based on stderr/warning patterns
+**Artifact-Driven Iteration:** Enable artifacts in config → Include common error patterns in system message → Add guidance based on stderr/warning patterns
 
-**Multi-Phase Evolution:**
+**Multi-Phase Evolution:** Start broad ("Explore different algorithmic approaches"), then focus ("Given successful simulated annealing, focus on parameter tuning")
 
-```yaml
-# Phase 1: Broad exploration
-system_message: "Explore different algorithmic approaches..."
-
-# Phase 2: Focused optimization
-system_message: "Given the successful simulated annealing approach,
-focus on parameter tuning and cooling schedules..."
-```
-
-**Template Stochasticity:**
-
-```yaml
-prompt:
-  template_dir: "custom_templates/"
-  use_template_stochasticity: true
-  template_variations:
-    greeting:
-      - "Let's optimize this code:"
-      - "Time to enhance:"
-      - "Improving:"
-  # Then use {greeting} in your templates to get random variations
-```
+**Template Stochasticity:** See the [Configuration section](#-configuration) for complete template variation examples.
 
 </details>
 
 ### Meta-Evolution: Using OpenEvolve to Optimize Prompts
 
-**You can use OpenEvolve to evolve your system messages themselves!**
-
-```yaml
-# Example: Evolve prompts for HotpotQA dataset
-Initial Prompt: "Answer the question based on the context."
-
-Evolved Prompt: "As an expert analyst, carefully examine the provided context.
-Break down complex multi-hop reasoning into clear steps. Cross-reference
-information from multiple sources to ensure accuracy. Answer: [question]"
-
-Result: +23% accuracy improvement on HotpotQA benchmark
-```
+**You can use OpenEvolve to evolve your system messages themselves!** This powerful technique lets you optimize prompts for better LLM performance automatically.
 
-See the [LLM Prompt Optimization example](examples/llm_prompt_optimization/) for a complete implementation.
+See the [LLM Prompt Optimization example](examples/llm_prompt_optimization/) for a complete implementation, including the HotpotQA case study with +23% accuracy improvement.
 
 ### Common Pitfalls to Avoid
 
@@ -825,20 +779,7 @@ Want to contribute? Check out our [roadmap discussions](https://github.com/codel
 <details>
 <summary><b>💰 How much does it cost to run?</b></summary>
 
-**Cost depends on your LLM provider and iterations:**
-
-- **o3**: ~$0.15-0.60 per iteration (depending on code size)
-- **o3-mini**: ~$0.03-0.12 per iteration (more cost-effective)
-- **Gemini-2.5-Pro**: ~$0.08-0.30 per iteration
-- **Gemini-2.5-Flash**: ~$0.01-0.05 per iteration (fastest and cheapest)
-- **Local models**: Nearly free after setup
-- **OptiLLM**: Use cheaper models with test-time compute for better results
-
-**Cost-saving tips:**
-- Start with fewer iterations (100-200)
-- Use o3-mini, Gemini-2.5-Flash or local models for exploration
-- Use cascade evaluation to filter bad programs early
-- Configure smaller population sizes initially
+See the [Cost Estimation](#cost-estimation) section in Installation & Setup for detailed pricing information and cost-saving tips.
 
 </details>
 
@@ -929,7 +870,7 @@ We welcome contributions! Here's how to get started:
 
 **Articles & Blog Posts About OpenEvolve**:
 - [Towards Open Evolutionary Agents](https://huggingface.co/blog/driaforall/towards-open-evolutionary-agents) - Evolution of coding agents and the open-source movement
-- [OpenEvolve: GPU Kernel Discovery](https://huggingface.co/blog/codelion/openevolve-gpu-kernel-discovery) - Automated discovery of optimized GPU kernels with 2-3x speedups
+- [OpenEvolve: GPU Kernel Discovery](https://huggingface.co/blog/codelion/openevolve-gpu-kernel-discovery) - Automated discovery of optimized GPU kernels
 - [OpenEvolve: Evolutionary Coding with LLMs](https://huggingface.co/blog/codelion/openevolve) - Introduction to evolutionary algorithm discovery using large language models
 
 ## 📊 Citation

From 6f501254355ec3c99552737ec27eb14de74256bf Mon Sep 17 00:00:00 2001
From: Asankhaya Sharma <codelion@users.noreply.github.com>
Date: Wed, 17 Sep 2025 11:24:02 +0800
Subject: [PATCH 5/6] Update README.md

---
 README.md | 85 ++++++++++++++++++++++++-------------------------------
 1 file changed, 37 insertions(+), 48 deletions(-)

diff --git a/README.md b/README.md
index 5e9b239bf..070103747 100644
--- a/README.md
+++ b/README.md
@@ -15,7 +15,7 @@
   <a href="https://github.com/codelion/openevolve/blob/main/LICENSE"><img src="https://img.shields.io/github/license/codelion/openevolve" alt="License"></a>
 </p>
 
-[🚀 **Quick Start**](#-quick-start) • [📖 **Examples**](#-examples-gallery) • [📝 **System Messages**](#-crafting-effective-system-messages) • [💬 **Discussions**](https://github.com/codelion/openevolve/discussions)
+[🚀 **Quick Start**](#quick-start) • [**Examples**](#examples-gallery) • [**System Messages**](#crafting-effective-system-messages) • [**Discussions**](https://github.com/codelion/openevolve/discussions)
 
 *From random search to state-of-the-art: Watch your code evolve in real-time*
 
@@ -23,25 +23,25 @@
 
 ---
 
-## ✨ Why OpenEvolve?
+## Why OpenEvolve?
 
 <table>
 <tr>
 <td width="33%">
 
-### 🎯 **Autonomous Discovery**
+### **Autonomous Discovery**
 LLMs don't just optimize—they **discover** entirely new algorithms. No human guidance needed.
 
 </td>
 <td width="33%">
 
-### ⚡ **Proven Results** 
+### **Proven Results**
 **2-3x speedups** on real hardware. **State-of-the-art** circle packing. **Breakthrough** optimizations.
 
 </td>
 <td width="33%">
 
-### 🔬 **Research Grade**
+### **Research Grade**
 Full reproducibility, extensive evaluation pipelines, and scientific rigor built-in.
 
 </td>
@@ -58,11 +58,11 @@ Full reproducibility, extensive evaluation pipelines, and scientific rigor built
 | **Multi-objective** | Complex tradeoffs | Automatic Pareto optimization |
 | **Scaling** | Doesn't scale | Parallel evolution across islands |
 
-## 🏆 Proven Achievements
+## Proven Achievements
 
 <div align="center">
 
-| 🎯 **Domain** | 📈 **Achievement** | 🔗 **Example** |
+| **Domain** | **Achievement** | **Example** |
 |---------------|-------------------|----------------|
 | **GPU Optimization** | Hardware-optimized kernel discovery | [MLX Metal Kernels](examples/mlx_metal_kernel_opt/) |
 | **Mathematical** | State-of-the-art circle packing (n=26) | [Circle Packing](examples/circle_packing/) |
@@ -127,12 +127,12 @@ result = evolve_function(
 print(f"Evolved sorting algorithm: {result.best_code}")
 ```
 
-**Prefer Docker?** See the [Installation & Setup](#-installation--setup) section for Docker options.
+**Prefer Docker?** See the [Installation & Setup](#installation--setup) section for Docker options.
 
-## 🎬 See It In Action
+## See It In Action
 
 <details>
-<summary><b>🔥 Circle Packing: From Random to State-of-the-Art</b></summary>
+<summary><b>Circle Packing: From Random to State-of-the-Art</b></summary>
 
 **Watch OpenEvolve discover optimal circle packing in real-time:**
 
@@ -146,7 +146,7 @@ print(f"Evolved sorting algorithm: {result.best_code}")
 </details>
 
 <details>
-<summary><b>⚡ GPU Kernel Evolution</b></summary>
+<summary><b>GPU Kernel Evolution</b></summary>
 
 **Before (Baseline)**:
 ```metal
@@ -174,23 +174,23 @@ kernel void attention_evolved(/* ... */) {
 
 </details>
 
-## 🧬 How OpenEvolve Works
+## How OpenEvolve Works
 
 OpenEvolve implements a sophisticated **evolutionary coding pipeline** that goes far beyond simple optimization:
 
 ![OpenEvolve Architecture](openevolve-architecture.png)
 
-### 🎯 **Core Innovation**: MAP-Elites + LLMs
+### **Core Innovation**: MAP-Elites + LLMs
 
 - **Quality-Diversity Evolution**: Maintains diverse populations across feature dimensions
 - **Island-Based Architecture**: Multiple populations prevent premature convergence
 - **LLM Ensemble**: Multiple models with intelligent fallback strategies
 - **Artifact Side-Channel**: Error feedback improves subsequent generations
 
-### 🚀 **Advanced Features**
+### **Advanced Features**
 
 <details>
-<summary><b>🔬 Scientific Reproducibility</b></summary>
+<summary><b>Scientific Reproducibility</b></summary>
 
 - **Comprehensive Seeding**: Every component (LLM, database, evaluation) is seeded
 - **Default Seed=42**: Immediate reproducible results out of the box
@@ -200,7 +200,7 @@ OpenEvolve implements a sophisticated **evolutionary coding pipeline** that goes
 </details>
 
 <details>
-<summary><b>🤖 Advanced LLM Integration</b></summary>
+<summary><b>Advanced LLM Integration</b></summary>
 
 - **Universal API**: Works with OpenAI, Google, local models, and proxies
 - **Intelligent Ensembles**: Weighted combinations with sophisticated fallback
@@ -210,7 +210,7 @@ OpenEvolve implements a sophisticated **evolutionary coding pipeline** that goes
 </details>
 
 <details>
-<summary><b>🧬 Evolution Algorithm Innovations</b></summary>
+<summary><b>Evolution Algorithm Innovations</b></summary>
 
 - **Double Selection**: Different programs for performance vs inspiration
 - **Adaptive Feature Dimensions**: Custom quality-diversity metrics
@@ -219,15 +219,15 @@ OpenEvolve implements a sophisticated **evolutionary coding pipeline** that goes
 
 </details>
 
-## 🎯 Perfect For
+## Perfect For
 
 | **Use Case** | **Why OpenEvolve Excels** |
 |--------------|---------------------------|
-| 🏃‍♂️ **Performance Optimization** | Discovers hardware-specific optimizations humans miss |
-| 🧮 **Algorithm Discovery** | Finds novel approaches to classic problems |
-| 🔬 **Scientific Computing** | Automates tedious manual tuning processes |
-| 🎮 **Competitive Programming** | Generates multiple solution strategies |
-| 📊 **Multi-Objective Problems** | Pareto-optimal solutions across dimensions |
+| **Performance Optimization** | Discovers hardware-specific optimizations humans miss |
+| **Algorithm Discovery** | Finds novel approaches to classic problems |
+| **Scientific Computing** | Automates tedious manual tuning processes |
+| **Competitive Programming** | Generates multiple solution strategies |
+| **Multi-Objective Problems** | Pareto-optimal solutions across dimensions |
 
 ## 🛠 Installation & Setup
 
@@ -356,23 +356,23 @@ llm:
 
 </details>
 
-## 📸 Examples Gallery
+## Examples Gallery
 
 <div align="center">
 
-### 🏆 **Showcase Projects**
+### **Showcase Projects**
 
 | Project | Domain | Achievement | Demo |
 |---------|--------|-------------|------|
-| [🎯 **Function Minimization**](examples/function_minimization/) | Optimization | Random → Simulated Annealing | [View Results](examples/function_minimization/openevolve_output/) |
-| [⚡ **MLX GPU Kernels**](examples/mlx_metal_kernel_opt/) | Hardware | Apple Silicon optimization | [Benchmarks](examples/mlx_metal_kernel_opt/README.md) |
-| [🔄 **Rust Adaptive Sort**](examples/rust_adaptive_sort/) | Algorithms | Data-aware sorting | [Code Evolution](examples/rust_adaptive_sort/) |
-| [📐 **Symbolic Regression**](examples/symbolic_regression/) | Science | Automated equation discovery | [LLM-SRBench](examples/symbolic_regression/) |
-| [🕸️ **Web Scraper + OptiLLM**](examples/web_scraper_optillm/) | AI Integration | Test-time compute optimization | [Smart Scraping](examples/web_scraper_optillm/) |
+| [**Function Minimization**](examples/function_minimization/) | Optimization | Random → Simulated Annealing | [View Results](examples/function_minimization/openevolve_output/) |
+| [**MLX GPU Kernels**](examples/mlx_metal_kernel_opt/) | Hardware | Apple Silicon optimization | [Benchmarks](examples/mlx_metal_kernel_opt/README.md) |
+| [**Rust Adaptive Sort**](examples/rust_adaptive_sort/) | Algorithms | Data-aware sorting | [Code Evolution](examples/rust_adaptive_sort/) |
+| [**Symbolic Regression**](examples/symbolic_regression/) | Science | Automated equation discovery | [LLM-SRBench](examples/symbolic_regression/) |
+| [**Web Scraper + OptiLLM**](examples/web_scraper_optillm/) | AI Integration | Test-time compute optimization | [Smart Scraping](examples/web_scraper_optillm/) |
 
 </div>
 
-### 🎯 **Quick Example**: Function Minimization
+### **Quick Example**: Function Minimization
 
 **Watch OpenEvolve evolve from random search to sophisticated optimization:**
 
@@ -388,7 +388,7 @@ def minimize_function(func, bounds, max_evals=1000):
     return best_x, best_val
 ```
 
-**↓ Evolution Process ↓**
+**Evolution Process**
 
 ```python
 # Evolved Program (Simulated Annealing + Adaptive Cooling)
@@ -413,20 +413,9 @@ def minimize_function(func, bounds, max_evals=1000):
 ### 🔬 **Advanced Examples**
 
 <details>
-<summary><b>🎨 Prompt Evolution</b></summary>
+<summary><b>Prompt Evolution</b></summary>
 
-**Evolve prompts instead of code** for better LLM performance:
-
-```yaml
-# Example: HotpotQA dataset
-Initial Prompt: "Answer the question based on the context."
-
-Evolved Prompt: "As an expert analyst, carefully examine the provided context. 
-Break down complex multi-hop reasoning into clear steps. Cross-reference 
-information from multiple sources to ensure accuracy. Answer: [question]"
-
-Result: +23% accuracy improvement on HotpotQA benchmark
-```
+**Evolve prompts instead of code** for better LLM performance. See the [LLM Prompt Optimization example](examples/llm_prompt_optimization/) for a complete case study with HotpotQA achieving +23% accuracy improvement.
 
 [Full Example](examples/llm_prompt_optimization/)
 
@@ -683,7 +672,7 @@ system_message: |
 
 **Multi-Phase Evolution:** Start broad ("Explore different algorithmic approaches"), then focus ("Given successful simulated annealing, focus on parameter tuning")
 
-**Template Stochasticity:** See the [Configuration section](#-configuration) for complete template variation examples.
+**Template Stochasticity:** See the [Configuration section](#configuration) for complete template variation examples.
 
 </details>
 
@@ -892,8 +881,8 @@ If you use OpenEvolve in your research, please cite:
 
 ### **🚀 Ready to evolve your code?**
 
-**Made with ❤️ by the OpenEvolve community**
+**Maintained by the OpenEvolve community**
 
-*Star ⭐ this repository if OpenEvolve helps you discover breakthrough algorithms!*
+*If OpenEvolve helps you discover breakthrough algorithms, please consider starring this repository.*
 
 </div>

From 5fc9fd41818ec6410d7165a9347ec8b6fb9378b6 Mon Sep 17 00:00:00 2001
From: Asankhaya Sharma <codelion@users.noreply.github.com>
Date: Wed, 17 Sep 2025 11:27:20 +0800
Subject: [PATCH 6/6] Update README.md

---
 README.md | 28 ++++++++++++++--------------
 1 file changed, 14 insertions(+), 14 deletions(-)

diff --git a/README.md b/README.md
index 070103747..b72f8470e 100644
--- a/README.md
+++ b/README.md
@@ -93,7 +93,7 @@ python openevolve-run.py examples/function_minimization/initial_program.py \
 
 **Note:** The example config uses Gemini by default, but you can use any OpenAI-compatible provider by modifying the `config.yaml`. See the [configs](configs/) for full configuration options.
 
-### 📚 **Library Usage**
+### **Library Usage**
 
 OpenEvolve can be used as a library without any external files:
 
@@ -410,7 +410,7 @@ def minimize_function(func, bounds, max_evals=1000):
 
 **Performance**: 100x improvement in convergence speed!
 
-### 🔬 **Advanced Examples**
+### **Advanced Examples**
 
 <details>
 <summary><b>Prompt Evolution</b></summary>
@@ -439,7 +439,7 @@ def minimize_function(func, bounds, max_evals=1000):
 
 </details>
 
-## ⚙️ Configuration
+## Configuration
 
 OpenEvolve offers extensive configuration for advanced users:
 
@@ -529,7 +529,7 @@ See [prompt examples](examples/llm_prompt_optimization/templates/) for complete
 
 </details>
 
-## 📝 Crafting Effective System Messages
+## Crafting Effective System Messages
 
 **System messages are the secret to successful evolution.** They guide the LLM's understanding of your domain, constraints, and optimization goals. A well-crafted system message can be the difference between random mutations and targeted improvements.
 
@@ -578,7 +578,7 @@ Based on successful OpenEvolve implementations, system messages are best created
 
 ### Examples by Complexity
 
-#### 🎯 **Simple: General Optimization**
+#### **Simple: General Optimization**
 ```yaml
 prompt:
   system_message: |
@@ -587,7 +587,7 @@ prompt:
     global minimum reliably, escaping local minima that might trap simple algorithms.
 ```
 
-#### 🔧 **Intermediate: Domain-Specific Guidance**
+#### **Intermediate: Domain-Specific Guidance**
 ```yaml
 prompt:
   system_message: |
@@ -690,7 +690,7 @@ See the [LLM Prompt Optimization example](examples/llm_prompt_optimization/) for
 - **No Examples**: Concrete examples guide LLM better than abstract descriptions
 - **Ignoring Artifacts**: Don't refine prompts based on error feedback
 
-## 🔧 Artifacts & Debugging
+## Artifacts & Debugging
 
 **Artifacts side-channel** provides rich feedback to accelerate evolution:
 
@@ -720,7 +720,7 @@ return EvaluationResult(
 
 This creates a **feedback loop** where each generation learns from previous mistakes!
 
-## 📊 Visualization
+## Visualization
 
 **Real-time evolution tracking** with interactive web interface:
 
@@ -745,7 +745,7 @@ python scripts/visualizer.py --path examples/function_minimization/openevolve_ou
 
 ![OpenEvolve Visualizer](openevolve-visualizer.png)
 
-## 🚀 Roadmap
+## Roadmap
 
 ### **🔥 Upcoming Features**
 
@@ -763,7 +763,7 @@ python scripts/visualizer.py --path examples/function_minimization/openevolve_ou
 
 Want to contribute? Check out our [roadmap discussions](https://github.com/codelion/openevolve/discussions/categories/roadmap)!
 
-## 🤔 FAQ
+## FAQ
 
 <details>
 <summary><b>💰 How much does it cost to run?</b></summary>
@@ -834,7 +834,7 @@ Just set the `api_base` in your config to point to your endpoint.
 
 </details>
 
-### 🌟 **Contributors**
+### **Contributors**
 
 Thanks to all our amazing contributors who make OpenEvolve possible!
 
@@ -842,7 +842,7 @@ Thanks to all our amazing contributors who make OpenEvolve possible!
   <img src="https://contrib.rocks/image?repo=codelion/openevolve" />
 </a>
 
-### 🤝 **Contributing**
+### **Contributing**
 
 We welcome contributions! Here's how to get started:
 
@@ -855,14 +855,14 @@ We welcome contributions! Here's how to get started:
 
 **New to open source?** Check out our [Contributing Guide](CONTRIBUTING.md) and look for [`good-first-issue`](https://github.com/codelion/openevolve/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22) labels!
 
-### 📚 **Academic & Research**
+### **Academic & Research**
 
 **Articles & Blog Posts About OpenEvolve**:
 - [Towards Open Evolutionary Agents](https://huggingface.co/blog/driaforall/towards-open-evolutionary-agents) - Evolution of coding agents and the open-source movement
 - [OpenEvolve: GPU Kernel Discovery](https://huggingface.co/blog/codelion/openevolve-gpu-kernel-discovery) - Automated discovery of optimized GPU kernels
 - [OpenEvolve: Evolutionary Coding with LLMs](https://huggingface.co/blog/codelion/openevolve) - Introduction to evolutionary algorithm discovery using large language models
 
-## 📊 Citation
+## Citation
 
 If you use OpenEvolve in your research, please cite: