cagymind.me
robots.txt

Robots Exclusion Standard data for cagymind.me

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cagymind.me
Base Domain	cagymind.me
Scan Status	Ok
Last Scan	2025-12-10T03:40:46+00:00
Next Scan	2025-12-17T03:40:46+00:00

Last Scan

Scanned	2025-12-10T03:40:46+00:00
URL	https://www.cagymind.me/robots.txt
Domain IPs	142.251.12.121, 2404:6800:4003:c02::79
Response IP	142.251.10.121
Found	Yes
Hash	1b825e8ebbe6ebff9c076d461f45707134a6853179c7d2922ba7f83b1d2c1d24
SimHash	3832e7b0d183

Groups

mediapartners-google

Rule	Path
Disallow

Rule

Path

Disallow

*

Product	Comment
*	to select all crawling bots and search engines

Rule	Path	Comment
Disallow	/search*	to block all user generated query item within the website.
Disallow	/20*	this line will disallow archieve section of Blogger.
Disallow	/feeds*	this line will disallow feeds. Read instruction below
Allow	/*.html	allow all post and pages of the blog

Rule

Path

Comment

Disallow

/search*

to block all user generated query item within the website.

Disallow

/20*

this line will disallow archieve section of Blogger.

Disallow

/feeds*

this line will disallow feeds. Read instruction below

Allow

/*.html

allow all post and pages of the blog

mediapartners-google

Rule	Path
Disallow

Rule

Path

Disallow

all browser and website inspector

Rule	Path
Allow	/

Rule

Path

Allow

*

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/calendar/
Disallow	/junk/
Disallow	/books/fiction/contemporary/

Rule

Path

Disallow

/calendar/

Disallow

/junk/

Disallow

/books/fiction/contemporary/

googlebot-news

Rule	Path
Allow	/

Rule

Path

Allow

*

Rule	Path
Disallow	/

Rule

Path

Disallow

unnecessarybot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Allow	/

Rule

Path

Allow

*

Rule	Path
Disallow	/useless_file.html
Disallow	/junk/other_useless_file.html

Rule

Path

Disallow

/useless_file.html

Disallow

/junk/other_useless_file.html

*

Rule	Path
Disallow	/
Allow	/public/

Rule

Path

Disallow

Allow

/public/

googlebot-image

Rule	Path
Disallow	/images/dogs.jpg

Rule

Path

Disallow

/images/dogs.jpg

googlebot-image

Rule	Path
Disallow	/

Rule

Path

Disallow

googlebot

Rule	Path
Disallow	/*.gif$

Rule

Path

Disallow

/*.gif$

*

Rule	Path
Disallow	/

Rule

Path

Disallow

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

googlebot

Rule	Path
Disallow	/*.xls$

Rule

Path

Disallow

/*.xls$

Other Records

Field	Value
sitemap	https://www.cagymind.me/sitemap.xml
sitemap	https://www.cagymind.me/sitemap.xml

Field

Value

sitemap

https://www.cagymind.me/sitemap.xml

sitemap

https://www.cagymind.me/sitemap.xml

Comments

sitemap of the blog

Warnings

1 invalid line.

cagymind.merobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mediapartners-google

*

mediapartners-google

all browser and website inspector

*

*

googlebot-news

*

unnecessarybot

*

*

*

googlebot-image

googlebot-image

googlebot

*

mediapartners-google

googlebot

Other Records

Comments

Warnings

cagymind.me
robots.txt