news.allcrimea.net
robots.txt

Robots Exclusion Standard data for news.allcrimea.net

Resource Scan

Scan Details

Site Domain news.allcrimea.net
Base Domain allcrimea.net
Scan Status Ok
Last Scan2024-10-31T21:17:02+00:00
Next Scan 2024-11-30T21:17:02+00:00

Last Scan

Scanned2024-10-31T21:17:02+00:00
URL https://news.allcrimea.net/robots.txt
Domain IPs 104.21.55.208, 172.67.172.203, 2606:4700:3036::ac43:accb, 2606:4700:3037::6815:37d0
Response IP 104.21.55.208
Found Yes
Hash d1dc6efd9a4c29b3d472b592beba8049d1312185af8c8b63dc8b0ca0ac62b137
SimHash 955551056d1a

Groups

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /

criteobot/0.1

Rule Path
Disallow /

proximic

Rule Path
Disallow /

Warnings

  • 1 invalid line.
  • `ahrefsbotdisallow` is not a known field.