Skip to content

Commit e62d971

Browse files
authored
Merge branch 'main' into trackio
2 parents 377db24 + 41a7e08 commit e62d971

17 files changed

+1258
-12
lines changed

_blog.yml

Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6340,6 +6340,7 @@
63406340
- models
63416341
- open-source
63426342

6343+
63436344
- local: migrating-the-hub-to-xet
63446345
title: "Migrating the Hub from Git LFS to Xet"
63456346
author: jsulz
@@ -6397,6 +6398,7 @@
63976398
- local: consilium-multi-llm
63986399
title: "Consilium: When Multiple LLMs Collaborate"
63996400
author: azettl
6401+
guest: true
64006402
thumbnail: /blog/assets/consilium-multi-llm/thumbnail.png
64016403
date: Jul 17, 2025
64026404
tags:
@@ -6413,3 +6415,49 @@
64136415
tags:
64146416
- collaboration
64156417
- guide
6418+
6419+
- local: lora-fast
6420+
title: "Fast LoRA inference for Flux with Diffusers and PEFT"
6421+
thumbnail: /blog/assets/lora-fast/thumbnail.png
6422+
author: sayakpaul
6423+
date: July 23, 2025
6424+
tags:
6425+
- lora
6426+
- diffusion
6427+
- guide
6428+
6429+
- local: timescope-video-lmm-benchmark
6430+
title: "TimeScope: How Long Can Your Video Large Multimodal Model Go?"
6431+
author: orrzohar
6432+
thumbnail: /blog/assets/timescope/thumbnail.png
6433+
date: Jul 23, 2025
6434+
tags:
6435+
- video
6436+
- datasets
6437+
- multimodal
6438+
- open-source
6439+
- benchmark
6440+
6441+
- local: parquet-cdc
6442+
title: "Parquet Content-Defined Chunking"
6443+
author: kszucs
6444+
thumbnail: /blog/assets/parquet-cdc/thumbnail.png
6445+
date: July 25, 2025
6446+
tags:
6447+
- data
6448+
- datasets
6449+
- dedupe
6450+
- hub
6451+
- parquet
6452+
- storage
6453+
- xet
6454+
6455+
- local: hf-cli
6456+
title: "Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨"
6457+
author: Wauplin
6458+
thumbnail: /blog/assets/hf-cli-thumbnail.png
6459+
date: Jul 25, 2025
6460+
tags:
6461+
- cli
6462+
- huggingface_hub
6463+
- python

assets/hf-cli-thumbnail.png

108 KB
Loading

assets/lora-fast/thumbnail.png

55.5 KB
Loading

assets/parquet-cdc/thumbnail.png

349 KB
Loading

assets/timescope/thumbnail.png

2 MB
Loading

consilium-multi-llm.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,6 +3,7 @@ title: "Consilium: When Multiple LLMs Collaborate"
33
thumbnail: /blog/assets/consilium-multi-llm/thumbnail.png
44
authors:
55
- user: azettl
6+
guest: true
67
---
78

89
# Consilium: When Multiple LLMs Collaborate

designing-positional-encoding.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -406,7 +406,7 @@ of values from our input vector. For \\(2D\\) data, we need to encode both horiz
406406

407407
## The future of positional encoding
408408

409-
Is RoPE the final incarnation of positional encoding? This [recent paper](https://arxiv.org/pdf/2410.06205) from DeepMind deeply analyses RoPE and highlights some fundamental problems. TLDR: RoPE isn't a perfect solution, and the models mostly focus on the lower frequencies and the rotation for a certain percent of low frequencies improves performance on Gemma 2B!
409+
Is RoPE the final incarnation of positional encoding? This [recent paper](https://arxiv.org/pdf/2410.06205) from DeepMind deeply analyses RoPE and highlights some fundamental problems. TLDR: RoPE isn't a perfect solution, and the models mostly focus on the lower frequencies, but the paper shows that **removing** (not rotating) the lowest frequencies improves performance on Gemma 2B!
410410

411411
I anticipate some future breakthroughs, perhaps taking inspiration from
412412
signal processing with ideas like wavelets or hierarchical implementations. As models

ettin.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -807,4 +807,4 @@ The complete Ettin suite includes models at six different scales (for both encod
807807
- [📝 Paper](https://github.com/jhu-clsp/ettin-encoder-vs-decoder)
808808
- [🗂️ Training Data](https://huggingface.co/datasets/jhu-clsp/ettin-pretraining-data) (2T+ tokens, fully open)
809809
- [💻 GitHub Repository](https://github.com/jhu-clsp/ettin-encoder-vs-decoder)
810-
- [📊 250+ Training Checkpoints](https://huggingface.co/datasets/jhu-clsp/ettin-checkpoints) for studying training dynamics or knowledge learning
810+
- [📊 250+ Training Checkpoints](https://huggingface.co/jhu-clsp/ettin-checkpoints) for studying training dynamics or knowledge learning

futurebench.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ Forecasting makes **contamination impossible by design**, as you can't train on
4040

4141
Perhaps most importantly, predictions about the future are **inherently verifiable**. We can wait and see who was right, creating an objective, time-stamped measure of model performance.
4242

43-
We therefore propose evaluating agents on their ability to predict future events (Ye et al., 2024; Karger et al., 2025). **FutureBench** draws from real-world prediction markets and emerging news to create interesting prediction tasks grounded in actual future outcomes. We collect events from platforms and live news coverages and manifolds markets, filtering them to focus on emerging events worth predicting. Using an agent-based approach, we curate scenarios that require genuine reasoning rather than simple pattern matching. Think geopolitical developments, market movements, or technology adoption trends - events where informed analysis actually matters.
43+
We therefore propose evaluating agents on their ability to predict future events (Ye et al., 2024; Karger et al., 2025). **FutureBench** draws from real-world prediction markets and emerging news to create interesting prediction tasks grounded in actual future outcomes. We collect events from platforms and live news coverages and manifold markets, filtering them to focus on emerging events worth predicting. Using an agent-based approach, we curate scenarios that require genuine reasoning rather than simple pattern matching. Think geopolitical developments, market movements, or technology adoption trends - events where informed analysis actually matters.
4444

4545
## Can Agents Predict Future Events?
4646
This is the obvious question, and it's at the heart of what makes this benchmark interesting! We believe the answer cannot be a simple “yes” or a “no”, as it mostly depends on the actual questions; there are always important caveats to consider.

gradio-mcp-updates.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,7 @@ As an MCP developer, it's highly recommended to implement your MCP tools to emit
3737

3838
## Transform OpenAPI Specs to MCP in One Line
3939

40-
If you want to integrate an existing backend API into an LLM, you have to manually map API endpoints to MCP tools. This can be a time consuming and error prone chore. With this release, Gradio can automate the entire process for you! With a single line of code, you can integrate your business backend into any MCP-compatible LLM.
40+
If you want to integrate an existing backend API into an LLM, you have to manually map API endpoints to MCP tools. This can be a time-consuming and error prone chore. With this release, Gradio can automate the entire process for you! With a single line of code, you can integrate your business backend into any MCP-compatible LLM.
4141

4242
[OpenAPI](https://www.openapis.org/) is a widely adopted standard for describing RESTful APIs in a machine-readable format, typically as a JSON file. Gradio now features the `gr.load_openapi` function, which creates a Gradio application directly from an OpenAPI schema. You can then launch the app with `mcp_server=True` to automatically create an MCP server for your API!
4343

@@ -96,7 +96,7 @@ You can read more about this in the Gradio [Guides](https://www.gradio.app/guide
9696

9797
## Modifying Tool Descriptions
9898

99-
Gradio automatically generates tool descriptions from your function names and docstrings. Now you can customize the tool description even further with the `api_description` parmeter. In this example, the tool description will read "Apply a sepia filter to any image."
99+
Gradio automatically generates tool descriptions from your function names and docstrings. Now you can customize the tool description even further with the `api_description` parameter. In this example, the tool description will read "Apply a sepia filter to any image."
100100

101101
```python
102102
import gradio as gr
@@ -129,4 +129,4 @@ Read more in the [guide](https://www.gradio.app/guides/building-mcp-server-with-
129129

130130
## Conclusion
131131

132-
Want us to add a new MCP-related feature to Gradio? Let us know in the comments of the blog or on [GitHub](https://github.com/gradio-app/gradio/issues). Also if you've built a cool MCP server or Gradio app let us know in the comments and we'll amplify it!
132+
Want us to add a new MCP-related feature to Gradio? Let us know in the comments of the blog or on [GitHub](https://github.com/gradio-app/gradio/issues). Also if you've built a cool MCP server or Gradio app let us know in the comments and we'll amplify it!

0 commit comments

Comments
 (0)