ncclondon.ac.uk
robots.txt

Robots Exclusion Standard data for ncclondon.ac.uk

Resource Scan

Scan Details

Site Domain ncclondon.ac.uk
Base Domain ncclondon.ac.uk
Scan Status Ok
Last Scan2024-10-01T19:08:41+00:00
Next Scan 2024-10-31T19:08:41+00:00

Last Scan

Scanned2024-10-01T19:08:41+00:00
URL https://www.ncclondon.ac.uk/robots.txt
Domain IPs 162.159.135.42
Response IP 162.159.135.42
Found Yes
Hash ec4053f10791d4725712bdd037f86e35e5595770a2b9419915a6fbc1e9ee92fb
SimHash 7a24cd5aa81a

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-content/upgrade/
Disallow /wp-content/cache/
Disallow /wp-content/uploads/
Disallow /wp-json/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /?s=
Disallow /search/
Disallow /author/
Disallow /comments/
Disallow /course-search-results/
Disallow /search-results/
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /*?*

Comments

  • Specific files
  • Prevent indexing of URL parameters