iamhiphopcracy.com
robots.txt

Robots Exclusion Standard data for iamhiphopcracy.com

Resource Scan

Scan Details

Site Domain iamhiphopcracy.com
Base Domain iamhiphopcracy.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-17T15:29:49+00:00
Next Scan 2024-11-16T15:29:49+00:00

Last Successful Scan

Scanned2024-06-27T15:05:21+00:00
URL https://iamhiphopcracy.com/robots.txt
Domain IPs 192.0.78.24, 192.0.78.25
Response IP 192.0.78.25
Found Yes
Hash 7e48185a439d1d8cf113eb5b816114552608a73ea12d3172bd6cf8e040aaa7db
SimHash b05318c2a240

Groups

*

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

Comments

  • This file was generated on Fri, 05 Apr 2024 03:26:19 +0000