@@ -202,12 +202,12 @@ while not done:
202
202
203
203
#### Datasets:
204
204
205
- | Environment Name | Datasets |
206
- | :-----------------------:| :-------------------:|
207
- | ` d4rl:maze2d-open-v0 ` | ` 1k, 10k, 100k, 1m ` |
208
- | ` d4rl:maze2d-medium-v1 ` | ` 1k, 10k, 100k, 1m ` |
209
- | ` d4rl:maze2d-umaze-v1 ` | ` 1k, 10k, 100k, 1m ` |
210
- | ` d4rl:maze2d-large-v1 ` | ` 1k, 10k, 100k, 1m ` |
205
+ | Environment Name | Datasets | Query-Count |
206
+ | :-----------------------:| :-------------------:| :-----------------: |
207
+ | ` d4rl:maze2d-open-v0 ` | ` 1k, 10k, 100k, 1m ` | ` 1500 ` |
208
+ | ` d4rl:maze2d-medium-v1 ` | ` 1k, 10k, 100k, 1m ` | ` 1500 ` |
209
+ | ` d4rl:maze2d-umaze-v1 ` | ` 1k, 10k, 100k, 1m ` | ` 1500 ` |
210
+ | ` d4rl:maze2d-large-v1 ` | ` 1k, 10k, 100k, 1m ` | ` 121 ` < sup >< strong > * </ strong ></ sup > |
211
211
212
212
#### Pre-trained policy performance:
213
213
@@ -228,11 +228,11 @@ while not done:
228
228
229
229
#### Datasets:
230
230
231
- | Environment Name | Datasets |
232
- | :----------------:| :------------------------------------------------------:|
233
- | ` HalfCheetah-v2 ` | ` random, expert, medium, medium-replay, medium-expert ` |
234
- | ` Hopper-v2 ` | ` random, expert, medium, medium-replay, medium-expert ` |
235
- | ` Walker2d-v2 ` | ` random, expert, medium, medium-replay, medium-expert ` |
231
+ | Environment Name | Datasets | Query-Count |
232
+ | :----------------:| :------------------------------------------------------:| :-----------------: |
233
+ | ` HalfCheetah-v2 ` | ` random, expert, medium, medium-replay, medium-expert ` | ` 1500 ` |
234
+ | ` Hopper-v2 ` | ` random, expert, medium, medium-replay, medium-expert ` | ` 1500 ` |
235
+ | ` Walker2d-v2 ` | ` random, expert, medium, medium-replay, medium-expert ` | ` 1500 ` |
236
236
237
237
#### Pre-trained Policy performance:
238
238
0 commit comments