DPO Dataset Annotation Interface

A web-based tool for annotating Direct Preference Optimization (DPO) datasets. This interface allows you to compare pairs of model responses and mark preferences for training preference-based models.

Features

Fast annotation workflow: Click a response to immediately save and advance
Auto-save functionality: Progress is automatically saved to browser storage
Resume capability: Continue from where you left off
Smart navigation: Automatically skips already annotated items
Markdown rendering: Properly displays formatted text, code blocks, and lists
Company name highlighting: Automatically highlights advertiser names in responses
Progress tracking: Visual progress bar showing completion status
Keyboard shortcuts: Speed up annotation with hotkeys

Getting Started

Save the HTML file to your computer
Open it in a web browser
Load your dataset using the "Load Dataset" button
Start annotating!

Dataset Format

The tool expects a JSON file with an array of objects. Each object must have the following structure:

Required Fields

{
  "prompt": "The instruction/prompt given to the model",
  "response_1": "First model response to compare",
  "response_2": "Second model response to compare", 
  "preference": "",
  "metadata": {
    "question": "Original question asked",
    "original_answer": "Original response without advertisements",
    "response_1_ad_metadata": {
      "top_products": [
        {
          "name": "Product Name",
          "description": "Product description",
          "category": "Product category"
        }
      ]
    },
    "response_2_ad_metadata": {
      "top_products": [
        {
          "name": "Product Name", 
          "description": "Product description",
          "category": "Product category"
        }
      ]
    }
  }
}

Field Descriptions

Field	Type	Description	Required
`prompt`	string	The instruction/prompt given to the model	✅ Yes
`response_1`	string	First response option for comparison	✅ Yes
`response_2`	string	Second response option for comparison	✅ Yes
`preference`	string	Annotation result (filled by the tool)	✅ Yes
`metadata`	object	Additional information about the responses	✅ Yes
`metadata.question`	string	Original question asked	✅ Yes
`metadata.original_answer`	string	Original response without ads	✅ Yes
`metadata.response_X_ad_metadata`	object	Advertisement metadata	❌ Optional
`metadata.response_X_ad_metadata.top_products`	array	Product information for highlighting	❌ Optional

Preference Values

The preference field will be automatically filled with one of these values:

"response_1" - Response 1 was preferred
"response_2" - Response 2 was preferred
"both_good" - Both responses are acceptable
"both_bad" - Both responses are poor quality

Example Dataset

[
  {
    "prompt": "Explain what a .ckpt file is in machine learning",
    "response_1": "A .ckpt file stores model state...",
    "response_2": "A .ckpt file, used in ML, contains...",
    "preference": "",
    "metadata": {
      "question": "What is a .ckpt file for machine learning?",
      "original_answer": "A .ckpt file stores the current state...",
      "response_1_ad_metadata": {
        "top_products": [
          {
            "name": "TensorFlow",
            "description": "Open source ML platform",
            "category": "Software/Tools"
          }
        ]
      },
      "response_2_ad_metadata": {
        "top_products": [
          {
            "name": "PyTorch", 
            "description": "Deep learning framework",
            "category": "Software/Tools"
          }
        ]
      }
    }
  }
]

Usage Instructions

Loading Data

Click "Load Dataset"
Select your JSON file
The tool will automatically resume from previous progress if available

Annotation Controls

Mouse Controls:

Click Response A or B to prefer that response
Click "Both Good" if both responses are acceptable
Click "Both Bad" if both responses are poor quality

Keyboard Shortcuts:

1 - Select Response A
2 - Select Response B
G - Mark both as good
B - Mark both as bad
←/→ - Navigate between items

Navigation Options

Forward: Annotate from beginning to end
Backward: Annotate from end to beginning
Use Previous/Next buttons for manual navigation

Auto-Save Features

Progress is saved automatically after each annotation
Data is backed up to browser storage every 30 seconds
Progress is preserved if you close the browser accidentally

Downloading Results

Click "Download Results" to save your annotated dataset
File will be saved as annotated_dpo_dataset.json
Original dataset remains unchanged

Data Privacy

All processing happens locally in your browser
No data is sent to external servers
Annotations are stored in browser's local storage
Original dataset files remain on your computer

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DPO Dataset Annotation Interface

Features

Getting Started

Dataset Format

Required Fields

Field Descriptions

Preference Values

Example Dataset

Usage Instructions

Loading Data

Annotation Controls

Navigation Options

Auto-Save Features

Downloading Results

Data Privacy

About

Uh oh!

Releases

Packages

Uh oh!

Languages

HectorRguez/DPO_annotation_tool

Folders and files

Latest commit

History

Repository files navigation

DPO Dataset Annotation Interface

Features

Getting Started

Dataset Format

Required Fields

Field Descriptions

Preference Values

Example Dataset

Usage Instructions

Loading Data

Annotation Controls

Navigation Options

Auto-Save Features

Downloading Results

Data Privacy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages