manualsrepo.com
robots.txt

Robots Exclusion Standard data for manualsrepo.com

Resource Scan

Scan Details

Site Domain manualsrepo.com
Base Domain manualsrepo.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-08-08T05:01:24+00:00
Next Scan 2025-11-06T05:01:24+00:00

Last Successful Scan

Scanned2023-04-22T03:46:25+00:00
URL https://manualsrepo.com/robots.txt
Domain IPs 104.18.0.56, 104.18.1.56, 2606:4700::6812:138, 2606:4700::6812:38
Response IP 104.18.0.56
Found Yes
Hash 133e7d47789bf2a2730d81bebbe52586094d194b7ac5c21ff61e39ed49eadffa
SimHash d00dd7e15d12

Groups

*

Rule Path
Disallow /download/manuals/*
Disallow /search/*
Disallow /pdf/
Disallow /cdn-cgi/
Disallow /prefetching/

mediapartners-google

Rule Path
Allow /

sogou web spider

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

addthis.com (http://support.addthis.com/)

Rule Path
Disallow /

proximic

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

criteobot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://manualsrepo.com/sitemap.xml