Uro
URL deduplicator and cleaner that removes duplicates and low-quality paths from URL lists
Recon & OSINT7 commands
#url#deduplicate#cleanup#python#recon
Installation
1Install via pip
2Verify installation
# Using pip
pip install uro
# Verify
uro --help
# Update
pip install --upgrade uroBasic Usage
Deduplicate and clean URL lists by removing duplicates, similar paths, and low-value entries
# Deduplicate URLs from stdin
cat urls.txt | uro
# Deduplicate and sort by path depth
cat urls.txt | uro --depth
# Only keep unique paths (ignore query params)
cat urls.txt | uro --unique
# Output to file
cat urls.txt | uro > clean-urls.txt
# Include subdomains
cat urls.txt | uro --subs
# Show statistics
cat urls.txt | uro --stats
# Keep URLs with specific extensions
cat urls.txt | uro --include php,asp,jsp
# Exclude specific extensions
cat urls.txt | uro --exclude png,jpg,cssCommand Reference
7 commands
1Sort by path depth
--depth2Keep only unique paths
--unique3Include subdomain URLs
--subs4Show deduplication statistics
--stats5Keep only URLs with these extensions
--include6Remove URLs with these extensions
--exclude7Group similar URLs
--similarWhen to Use
1
Cleaning up large URL lists before vulnerability scanning2
Removing duplicate entries from multiple recon sources3
Reducing noise by excluding static file extensions4
Preparing clean input for tools like gf or dalfox5
Analyzing URL dataset statistics and patternsNotes & Tips
1
Reduces URL list size significantly by removing duplicates2
Can filter by file extension to focus on dynamic pages3
Use --depth to prioritize shallow paths for manual review4
Works well in pipelines with gau, waybackurls, and other URL toolsCommon Errors & Solutions
Empty output
Check input format — each URL should be on a separate line
Too aggressive filtering
Remove --exclude or adjust include/exclude lists