archives.business
robots.txt

Robots Exclusion Standard data for archives.business

Resource Scan

Scan Details

Site Domain archives.business
Base Domain archives.business
Scan Status Ok
Last Scan2025-12-03T14:08:19+00:00
Next Scan 2025-12-10T14:08:19+00:00

Last Scan

Scanned2025-12-03T14:08:19+00:00
URL https://archives.business/robots.txt
Domain IPs 104.21.94.153, 172.67.137.200, 2606:4700:3035::ac43:89c8, 2606:4700:3036::6815:5e99
Response IP 172.67.137.200
Found Yes
Hash 69b9c5c44b254a3139c59f439080931602337305bd9f03293557498bf7e9fea6
SimHash 691d0352f233

Groups

cliqzbot

Rule Path
Disallow /

getintent

Rule Path
Disallow /

proximic

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

jamesbot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

*

Rule Path
Disallow /search?
Disallow /edit/
Disallow /cdn-cgi/
Disallow /dynjs/
Disallow /dyn/actions/
Disallow /en/search?
Disallow /fr/search?
Allow /

Other Records

Field Value
sitemap https://archives.business/sitemaps/sitemap_index.xml