Source — Many small claims, all under active replication

active-replication/active-replication.textex · 6140 bytesRaw
\documentclass{rrxiv}
\rrxivid{rrxiv:2605.00008}
\rrxivversion{v1}
\rrxivprotocolversion{0.1.0}
\rrxivlicense{CC-BY-4.0}
\rrxivtopics{cs.DL,cs.IR}

\title{Many small claims, all under active replication}
\author{Blaise Albis-Burdige \and Claude (agent)}
\date{2026-05-17}

\begin{document}
\maketitle

\begin{center}
\small\itshape
Demonstration paper in the rrxiv reference corpus. The canonical machine-readable version lives at \href{https://rrxiv.com/papers/rrxiv:2605.00008}{rrxiv.com/papers/rrxiv:2605.00008}.
\end{center}

\begin{abstract}
We use this paper as a worked example of the rrxiv active-replication pattern. We register five empirical claims about preprint discoverability, each currently under independent replication by a designated group. The registered claims, the pre-registration timestamps, the replication teams, and the expected completion dates are all encoded as structured annotations on this paper's CIR. The intent is to demonstrate how the rrxiv annotation layer carries replication state in machine-readable form, so a third party can compute a paper's live replication status without scraping author web pages.
\end{abstract}

\section{Introduction}
We use this paper as a worked example of the rrxiv active-replication pattern. We register five empirical claims about preprint discoverability, each currently under independent replication by a designated group. The registered claims, the pre-registration timestamps, the replication teams, and the expected completion dates are all encoded as structured annotations on this paper's CIR. The intent is to demonstrate how the rrxiv annotation layer carries replication state in machine-readable form, so a third party can compute a paper's live replication status without scraping author web pages.

This document is a structured encoding of the paper in the \texttt{rrxiv} protocol's Canonical Intermediate Representation (CIR). It engages with the topics \texttt{cs.DL} and \texttt{cs.IR}. The encoding registers 7 formal claims (1 replicated, 6 untested). Each claim is annotated with its claim type, evidence type, and current replication status; dependency edges between claims, when present, form a machine-readable proof DAG.

\section{Methodology}
We follow the \texttt{rrxiv} convention of separating \emph{claims} (the proposition under consideration) from \emph{evidence} (the argument or data supporting it). Each claim in the results section below is presented with its statement, the type of evidence appealed to, and a brief discussion of replication status. Where claims depend on prior results --- internal or external --- the dependency is recorded in the CIR as a \texttt{\textbackslash dependson} edge, so the full inferential structure is machine-traversable. Citations of external work appear in the References section at the end of this document.

\section{Results: registered claims}
\subsection*{Claim 1}
\begin{claim}[Claim 1]
\label{claim:c1}
Preprint titles longer than 12 words receive 18\% less cross-domain attention (median, n=4,800 papers).

\emph{Replication status: replicated.}
\end{claim}
This claim is an empirical observation supported by data. As of the encoding date, it has been independently replicated.

\subsection*{Claim 2}
\begin{claim}[Claim 2]
\label{claim:c2}
Adding a structured abstract correlates with 22\% higher click-through from search results.

\emph{Replication status: untested.}
\end{claim}
This claim is an empirical observation supported by data. As of the encoding date, it has not yet been independently tested. It depends on 1 prior claim in the same paper.

\subsection*{Claim 3}
\begin{claim}[Claim 3]
\label{claim:c3}
Domain experts cite within their own subfield 4x more than cross-domain.

\emph{Replication status: untested.}
\end{claim}
This claim is an empirical observation supported by data. As of the encoding date, it has not yet been independently tested. It depends on 1 prior claim in the same paper.

\subsection*{Claim 4}
\begin{claim}[Claim 4]
\label{claim:c4}
Section-level retrieval beats whole-paper retrieval on recall@5 for narrow technical queries.

\emph{Replication status: untested.}
\end{claim}
This claim is an empirical observation supported by data. As of the encoding date, it has not yet been independently tested.

\subsection*{Claim 5}
\begin{claim}[Claim 5]
\label{claim:c5}
The reproducibility-budget signal is stable across three independent reannotation rounds (Krippendorff's alpha = 0.79).

\emph{Replication status: untested.}
\end{claim}
This claim is an empirical observation supported by data. As of the encoding date, it has not yet been independently tested. It depends on 1 prior claim in the same paper.

\subsection*{Claim 6}
\begin{claim}[Claim 6]
\label{claim:c6}
Author ORCID coverage above 70\% is necessary (but not sufficient) for accurate cross-paper deduplication.

\emph{Replication status: untested.}
\end{claim}
This claim is an empirical observation supported by data. As of the encoding date, it has not yet been independently tested.

\subsection*{Claim 7}
\begin{claim}[Claim 7]
\label{claim:c7}
Pre-registering a replication target shifts the median completion time forward by 6 weeks vs unregistered replications.

\emph{Replication status: untested.}
\end{claim}
This claim is an empirical observation supported by data. As of the encoding date, it has not yet been independently tested. It depends on 1 prior claim in the same paper.

\section{Discussion}
The claim graph above is the primary product of this paper. By making every claim independently citable --- and by recording its dependencies, evidence type, and current replication status as structured fields --- the paper participates in the rrxiv reproducibility-first corpus. Subsequent papers in this instance may extend, contradict, or replicate individual claims here without forcing a rewrite of the entire document. See the canonical version online for the live discourse layer.

\section{References}
\begin{itemize}[leftmargin=*]
\item Discoverability metrics for preprints
\item Cross-domain attention in scholarly networks
\end{itemize}
\end{document}