newsite.com
robots.txt

Robots Exclusion Standard data for newsite.com

Resource Scan

Scanned	2021-09-30T10:28:28+00:00
URL	http://newsite.com/robots.txt
Redirect	https://newsiteinternet.mystrikingly.com/robots.txt
Redirect Domain	newsiteinternet.mystrikingly.com
Redirect Base	mystrikingly.com
Found	Yes
Hash	d069503de69e19ceb44e4c68f10a55c90374baae1208b569a5adb1cb7f395f5a
SimHash	b28d6f8d6440

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Field	Value
sitemap	https://newsiteinternet.mystrikingly.com/sitemap.xml

Field

Value

sitemap

https://newsiteinternet.mystrikingly.com/sitemap.xml

Back to top

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-Agent: *
Disallow: /

Back to top