OARC 46

Name: OARC 46
Start: 2026-05-16T09:00:00+01:00
End: 2026-05-17T17:10:00+01:00
Location: Edinburgh International Conference Centre

16–17 May 2026 Workshop

Edinburgh International Conference Centre

Europe/London timezone

Contact

Estimation on the Root DITL Dataset

17 May 2026, 15:10

20m

Tinto and Moorfoot (Edinburgh International Conference Centre)

Tinto and Moorfoot

Edinburgh International Conference Centre

The Exchange Edinburgh EH3 8EE Scotland

In-Person Standard Presentation Main Session OARC 46 Day 2

Kazunori Fujiwara (Japan Registry Services Co., Ltd)

The DITL dataset serves as an invaluable resource for DNS research. The author gratefully acknowledges the data providers and DNS-OARC for permitting access to the Root DITL dataset. Because data collection methodologies vary significantly—with each Root Server Operator (RSO) capturing traffic to the best of their respective capabilities—it is essential to characterize the attributes of each dataset before analysis.

Despite this need, there is currently no standardized documentation regarding whether specific datasets are anonymized, the extent to which IP addresses are masked (e.g., prefix preservation), or whether the data represents partial or complete traffic logs. This presentation details an estimation of the DITL-2024 and 2025 dataset attributes:

Full Source IP Preservation: c (2024), g, k, and m-root datasets.

Partial Anonymization (Prefixes Preserved): a, b, d, f, h, and j-root datasets appear to mask source IPs but preserve /24 (IPv4) and /64 (IPv6) prefixes.

Full Anonymization (No Prefix Preservation): i and l (2024) root datasets.

Furthermore, by cross-referencing these datasets with RSSAC002 metrics for April 10, 2024, and April 9, 2025, I assessed data completeness. My findings suggest that the e-root dataset contains approximately 1% of total queries, the f-root dataset contains roughly one-third of the expected traffic, and the i-root dataset exhibits data gaps. Finally, as UDP checksums appear to be preserved in certain datasets, I attempted to reverse-engineer the original source IP addresses, with limited success in specific instances.

Talk duration	20 Minutes (+5 for Q&A)

Kazunori Fujiwara (Japan Registry Services Co., Ltd)

There are no materials yet.

OARC 46

Contact

Estimation on the Root DITL Dataset

Tinto and Moorfoot

Edinburgh International Conference Centre

Speaker

Description

Primary author

Presentation materials

Choose timezone

OARC 46

Contact

Speaker

Description

Primary author

Presentation materials