agrowia.com
robots.txt

Robots Exclusion Standard data for agrowia.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	agrowia.com
Base Domain	agrowia.com
Scan Status	Ok
Last Scan	2024-09-29T05:23:34+00:00
Next Scan	2024-10-06T05:23:34+00:00

Last Scan

Scanned	2024-09-29T05:23:34+00:00
URL	https://www.agrowia.com/robots.txt
Domain IPs	142.251.12.121, 2404:6800:4003:c1c::79
Response IP	74.125.130.121
Found	Yes
Hash	09e5b9037dca1b8064f3f01473db645c76d80ccd76f3c665dead6c32891cb46e
SimHash	3030d500c5b3

Groups

mediapartners-google

Rule	Path
Disallow

Rule

Path

Disallow

*

Product	Comment
*	to select all crawling bots and search engines

Rule	Path	Comment
Disallow	/search*	to block all user generated query item within the website.
Disallow	/20*	this line will disallow archieve section of Blogger.
Disallow	/feeds*	this line will disallow feeds. Read instruction below
Allow	/*.html	allow all post and pages of the blog

Rule

Path

Comment

Disallow

/search*

to block all user generated query item within the website.

Disallow

/20*

this line will disallow archieve section of Blogger.

Disallow

/feeds*

this line will disallow feeds. Read instruction below

Allow

/*.html

allow all post and pages of the blog

Back to top

Other Records

Field	Value
sitemap	https://www.agrowia.com/sitemap.xml
sitemap	https://www.agrowia.com/sitemap-pages.xml

Field

Value

sitemap

https://www.agrowia.com/sitemap.xml

sitemap

https://www.agrowia.com/sitemap-pages.xml

Back to top

Comments

sitemap of the blog

Back to top

agrowia.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mediapartners-google

*

Other Records

Comments

agrowia.com
robots.txt