savefromnets.com
robots.txt

Robots Exclusion Standard data for savefromnets.com

Resource Scan

Scan Details

Site Domain savefromnets.com
Base Domain savefromnets.com
Scan Status Ok
Last Scan2024-11-10T19:54:11+00:00
Next Scan 2024-11-17T19:54:11+00:00

Last Scan

Scanned2024-11-10T19:54:11+00:00
URL https://savefromnets.com/robots.txt
Domain IPs 104.21.22.70, 172.67.203.84, 2606:4700:3030::6815:1646, 2606:4700:3030::ac43:cb54
Response IP 104.21.22.70
Found Yes
Hash 23cdfe9f8cdefdaa36b1a3a4be631e56d81bf787ba7e07f18e4f3eeb4f5def70
SimHash 041cce20e790

Groups

*

Rule Path
Disallow /

googlebot

Rule Path
Disallow

bingbot

Rule Path
Disallow

duckduckbot

Rule Path
Disallow

baiduspider

Rule Path
Disallow

yandex

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Comments

  • Block all user-agents by default
  • Allow major search engines
  • Disallow common bad bots
  • Specify the preferred host

Warnings

  • `host` is not a known field.