Deduplication Impact

i

What This Shows: Article counts by source before deduplication, showing which search engines contributed most to the raw results.

Why It Matters: Duplicates from overlapping search results would skew analysis. 22.9% of articles were removed as duplicates.

How to Interpret: Bar length shows raw count. The legend shows before->after totals. Academic sources are prioritized when keeping duplicates.

Raw search results by source before duplicate removal
22.9% removed
1,875 duplicates
OPEN ↗
Academic
News
Other
8,202 -> 6,327 unique articles