wardayacollege.com
robots.txt

Robots Exclusion Standard data for wardayacollege.com

Resource Scan

Scan Details

Site Domain wardayacollege.com
Base Domain wardayacollege.com
Scan Status Ok
Last Scan2024-11-13T23:42:01+00:00
Next Scan 2024-11-20T23:42:01+00:00

Last Scan

Scanned2024-11-13T23:42:01+00:00
URL https://wardayacollege.com/robots.txt
Redirect https://www.wardayacollege.com/robots.txt
Redirect Domain www.wardayacollege.com
Redirect Base wardayacollege.com
Domain IPs 104.21.45.78, 172.67.211.59, 2606:4700:3034::ac43:d33b, 2606:4700:3035::6815:2d4e
Redirect IPs 104.21.45.78, 172.67.211.59, 2606:4700:3034::ac43:d33b, 2606:4700:3035::6815:2d4e
Response IP 104.21.45.78
Found Yes
Hash 1f9e5a3eb819309d2da7df85f946ecf21570de9af16b3d2fd1a9f3183a87a865
SimHash 65b1535d47c9

Groups

*

Rule Path
Disallow /kimia2
Disallow /wp-content/plugins/_site-functionality/js/excluded
Disallow /wp-content/themes/praktismedia/js/excluded
Disallow /wp-content/cache/min/1/wp-content/plugins/_site-functionality/js/excluded
Disallow /wp-content/cache/min/1/wp-content/themes/praktismedia/js/excluded
Disallow /test
Disallow /draft
Disallow /wp-admin/
Disallow /wp-login/
Disallow /xmlrpc.php
Disallow /cgi-bin/
Disallow *%26preview%3D
Disallow */comments
Disallow */feed
Disallow */search
Disallow */trackback
Disallow *%26p%3D
Allow */amp$
Allow */?amp$

Comments

  • Allow Search Engine to index all the site
  • Exclude Non-Primary JS
  • Disallow /test /draft
  • Disallow: /arti-kata
  • Disallow: /arti
  • Disallow wp- & preview
  • Disallow: /wp-includes/
  • Disallow feed, trackback, query string
  • Disallow: *?
  • Allow amp
  • To put as comment when Disallow everything
  • Disallow everything
  • Disallow: /