custompc.com
robots.txt

Robots Exclusion Standard data for custompc.com

Resource Scan

Scan Details

Site Domain custompc.com
Base Domain custompc.com
Scan Status Ok
Last Scan2024-11-13T22:42:35+00:00
Next Scan 2024-11-20T22:42:35+00:00

Last Scan

Scanned2024-11-13T22:42:35+00:00
URL https://custompc.com/robots.txt
Redirect https://www.custompc.com/robots.txt
Redirect Domain www.custompc.com
Redirect Base custompc.com
Domain IPs 104.21.22.214, 172.67.207.30, 2606:4700:3034::6815:16d6, 2606:4700:3035::ac43:cf1e
Redirect IPs 104.21.22.214, 172.67.207.30, 2606:4700:3034::6815:16d6, 2606:4700:3035::ac43:cf1e
Response IP 172.67.207.30
Found Yes
Hash 5a5b30ad24e3878741adc9e73e0194c5ac80a34bb3263954d58b9fe8aafac27d
SimHash 431dca528aba

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /widgets/
Disallow /page/
Disallow /page
Disallow /constellationwidget/
Disallow /mediaapi/
Disallow /wp_cron.php
Disallow /xmlrpc.php
Disallow /.well-known/amphtml/apikey.pub
Disallow /search/*
Disallow /tag/*
Disallow /user/*
Disallow /profile/*
Disallow /taxonomy/*
Disallow /filter/*
Disallow /custom-home
Disallow /wp-json/*
Disallow /profiles/*
Disallow /?*
Disallow /*?*

ahrefsbot
semrushbot
dotbot
mauibot
mj12bot
claudebot

Rule Path
Disallow /

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.custompc.com/sitemap.xml
sitemap https://www.custompc.com/googlenews.xml