/.well-known/

Log In Sign Up

burnsidenews.com
robots.txt

Robots Exclusion Standard data for burnsidenews.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	burnsidenews.com
Base Domain	burnsidenews.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Request timed out.
Last Scan	2024-04-03T04:39:14+00:00
Next Scan	2024-07-02T04:39:14+00:00

Last Successful Scan

Scanned	2022-11-16T03:27:35+00:00
URL	http://burnsidenews.com/robots.txt
Redirect	https://www.saltwire.com/robots.txt
Redirect Domain	www.saltwire.com
Redirect Base	saltwire.com
Response IP	104.21.21.143, 172.67.199.30
Found	Yes
Hash	04d6759252dcc7b836e6d4fe0803b0133b0897e4b70d049b7980b5035cc23866
SimHash	bc149d1bc744

Groups

*

Rule

Path

Disallow

/admin/

Disallow

/api/

Disallow

/search/

Other Records

Field

Value

crawl-delay

10

Back to top

Other Records

Field

Value

sitemap

https://www.saltwire.com/media/sitemaps/sitemap.xml

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/
For syntax checking, see:
https://technicalseo.com/tools/robots-txt/
Directories

Back to top