buptjz.github.io
robots.txt

Robots Exclusion Standard data for buptjz.github.io

Resource Scan

Scan Details

Site Domain buptjz.github.io
Base Domain buptjz.github.io
Scan Status Ok
Last Scan2024-08-30T04:39:04+00:00
Next Scan 2024-09-29T04:39:04+00:00

Last Scan

Scanned2024-08-30T04:39:04+00:00
URL https://buptjz.github.io/robots.txt
Domain IPs 185.199.108.153, 185.199.109.153, 185.199.110.153, 185.199.111.153, 2606:50c0:8000::153, 2606:50c0:8001::153, 2606:50c0:8002::153, 2606:50c0:8003::153
Response IP 185.199.110.153
Found Yes
Hash 77a9ddf80d30ad22d5fdae92a32c943dc9da177160e67a23ede30ab95df1b4aa
SimHash b28d298d6550

Groups

*

Rule Path
Disallow /test/
Disallow /old/
Disallow /rmit/
Disallow /bin/

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: