find-man.com
robots.txt

Robots Exclusion Standard data for find-man.com

Resource Scan

Scan Details

Site Domain find-man.com
Base Domain find-man.com
Scan Status Ok
Last Scan2025-11-04T07:13:27+00:00
Next Scan 2025-12-04T07:13:27+00:00

Last Scan

Scanned2025-11-04T07:13:27+00:00
URL https://find-man.com/robots.txt
Redirect https://www.find-man.com/robots.txt
Redirect Domain www.find-man.com
Redirect Base find-man.com
Domain IPs 104.21.36.135, 172.67.194.212
Redirect IPs 104.21.36.135, 172.67.194.212
Response IP 104.21.36.135
Found Yes
Hash 00c30b78fb797bf2d2bd19411ca4b7b289fa08e6176b45fed47666599fc186b1
SimHash 802547f06798

Groups

ahrefsbot
baiduspider
aport
ia_archiver
ia_archiver-web.archive.org
domaincrawler
lycos
mj12bot
grapeshot
proximic
scooter
amazonbot
slurp
webalta
weborama-fetcher
semrushbot
semrushbot-sa

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow /js/

*

Rule Path
Disallow /js/
Disallow /search/inn/?val=
Disallow /search/*%26page%3D
Disallow /arbitrage/*/page
Disallow /contract/*/page

Warnings

  • `host` is not a known field.