================================================================================
  The upscaler.video Codec Support Dataset
  Version: 2026-01-19
  License: CC-BY 4.0
================================================================================

OVERVIEW
--------
This dataset contains 71,334,706 individual WebCodecs API compatibility tests
collected from 224,360 real user sessions at free.upscaler.video.

Each row represents one codec string tested on one user's device, including
both encoder and decoder support data, along with the full browser user agent
string for detailed analysis.

DATASET FILE
------------
upscaler-video-codec-dataset-raw.csv (12.52 GB uncompressed)

CSV SCHEMA
----------
timestamp          - ISO 8601 timestamp of when test was performed
user_agent         - Full browser user agent string
browser            - Browser family (Chrome, Safari, Edge, Firefox, Unknown)
platform_raw       - Raw platform identifier from navigator.platform
platform           - Normalized platform (Windows, macOS, iOS, Android, Linux)
codec              - WebCodecs codec string tested (e.g., "av01.0.01M.08")
encoder_supported  - Whether VideoEncoder supports codec (true/false)
decoder_supported  - Whether VideoDecoder supports codec (true/false/empty)

EXAMPLE ROWS
------------
2026-01-05T00:54:11.570Z,"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36...",Edge,Win32,Windows,av01.0.01M.08,true,
2026-01-16T23:58:08.560Z,"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36...",Chrome,Win32,Windows,avc1.64040c,false,true

Note: decoder_supported is empty for sessions before Jan 14, 2026. The second
example shows a codec that can decode but not encode.

QUICK STATS
-----------
Total Tests:       71,334,706
Sessions:          224,360
Unique Codecs:     1,087
Collection Period: January 2026
Data Source:       free.upscaler.video
Encoder Data:      All 224,360 sessions
Decoder Data:      Sessions from Jan 14, 2026 onwards

PLATFORM NORMALIZATION
----------------------
The 'platform' column applies the following normalization:
  Win32, Win64 → Windows
  MacIntel → macOS
  iPhone, iPad → iOS
  Linux armv8*, Android* → Android
  Linux x86_64, Linux aarch64 → Linux

The 'platform_raw' column contains the original navigator.platform value.

USAGE EXAMPLES
--------------
Python (pandas):
  import pandas as pd
  df = pd.read_csv('upscaler-video-codec-dataset-raw.csv')

  # Calculate global encoder support per codec
  encoder_support = df.groupby('codec')['encoder_supported'].apply(
      lambda x: (x == 'true').mean() * 100
  )

  # Calculate decoder support (excluding empty values)
  df_with_decoder = df[df['decoder_supported'] != '']
  decoder_support = df_with_decoder.groupby('codec')['decoder_supported'].apply(
      lambda x: (x == 'true').mean() * 100
  )

R:
  library(readr)
  df <- read_csv('upscaler-video-codec-dataset-raw.csv')

  # Filter for specific browser
  chrome_only <- df[df$browser == 'Chrome',]

SQL (import to database):
  CREATE TABLE codec_tests (
    timestamp TIMESTAMP,
    user_agent TEXT,
    browser VARCHAR(50),
    platform_raw VARCHAR(50),
    platform VARCHAR(50),
    codec VARCHAR(100),
    encoder_supported BOOLEAN,
    decoder_supported VARCHAR(10)  -- 'true', 'false', or empty string
  );

CITATION
--------
If you use this dataset in research or publications, please cite:

  Bhattacharyya, Samrat. (2026). The upscaler.video Codec Support Dataset.
  Version 2026-01-19. https://free.upscaler.video/research/methodology/

BibTeX:
  @dataset{upscaler_codec_dataset_2026,
    title   = {The upscaler.video Codec Support Dataset},
    author  = {Bhattacharyya, Samrat},
    year    = {2026},
    version = {2026-01-19},
    url     = {https://free.upscaler.video/research/methodology/},
    note    = {71.3M codec tests from 224k sessions}
  }

LICENSE
-------
Creative Commons Attribution 4.0 International (CC-BY 4.0)

You are free to:
- Share and redistribute in any format
- Adapt, remix, and build upon the data
- Use commercially

Under the following terms:
- Attribution: Credit "upscaler.video Codec Support Dataset" with link

Full license: https://creativecommons.org/licenses/by/4.0/

MORE INFORMATION
----------------
Dataset Documentation:
  https://webcodecsfundamentals.org/datasets/codec-support/

Methodology Details:
  https://free.upscaler.video/research/methodology/

Codec Registry (interactive table):
  https://webcodecsfundamentals.org/datasets/codec-registry/

Questions or Issues:
  https://github.com/sb2702/webcodecs-fundamentals/issues

================================================================================
Dataset collected using free.upscaler.video
An open-source reference implementation of production WebCodecs workflows
================================================================================
