bradystandard.com
robots.txt

Robots Exclusion Standard data for bradystandard.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	bradystandard.com
Base Domain	bradystandard.com
Scan Status	Ok
Last Scan	2024-10-18T00:24:08+00:00
Next Scan	2024-11-17T00:24:08+00:00

Last Scan

Scanned	2024-10-18T00:24:08+00:00
URL	https://bradystandard.com/robots.txt
Domain IPs	104.154.203.214
Response IP	104.154.203.214
Found	Yes
Hash	e8f5b974cd3c554f1cd7bccb640c8dc39efd39552b91471e81d6f2b77d1c60e3
SimHash	a20d1bc5a455

Groups

*

Rule	Path
Disallow	/?page=*
Disallow	/editions/*
Disallow	/users/*
Disallow	/feed
Disallow	/feeds
Disallow	/rss
Disallow	/?q=*

Rule

Path

Disallow

/*?*page=*

Disallow

/editions/*

Disallow

/users/*

Disallow

/feed

Disallow

/feeds

Disallow

/rss

Disallow

/*?*q=*

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://s3.amazonaws.com/cjp-public-access/sitemaps/bsh/sitemap.xml.gz

Field

Value

sitemap

https://s3.amazonaws.com/cjp-public-access/sitemaps/bsh/sitemap.xml.gz

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file

Back to top

bradystandard.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

semrushbot

petalbot

Other Records

Comments

bradystandard.com
robots.txt