gosocial.org.uk
robots.txt

Robots Exclusion Standard data for gosocial.org.uk

Resource Scan

Scan Details

Site Domain gosocial.org.uk
Base Domain gosocial.org.uk
Scan Status Ok
Last Scan2024-09-10T20:01:28+00:00
Next Scan 2024-10-10T20:01:28+00:00

Last Scan

Scanned2024-09-10T20:01:28+00:00
URL https://gosocial.org.uk/robots.txt
Redirect http://gosocial.org.uk/robots.txt
Domain IPs 109.203.100.122
Response IP 109.203.100.122
Found Yes
Hash 14e20e02638c62f594a50473505792c28651d60b3330c934cfce440c6e5ed0fd
SimHash 3c3d1d52c7d5

Groups

*

Rule Path
Disallow ow_version.xml
Disallow INSTALL.txt
Disallow LICENSE.txt
Disallow README.txt
Disallow UPDATE.txt
Disallow CHANGES.txt
Disallow /admin/

Other Records

Field Value
sitemap http://cdn.attracta.com/sitemap/3024418.xml.gz

Comments

  • This file contains rules to prevent the crawling and indexing of certain parts
  • of your web site by spiders of a major search engines likes Google and Yahoo.
  • By managing these rules you can allow or disallow access to specific folders
  • and files for such spyders.
  • The good way to hide private data or save a lot of bandwidth.
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html
  • Files
  • URLs
  • Begin Attracta SEO Tools Sitemap. Do not remove
  • End Attracta SEO Tools Sitemap. Do not remove