learngreen.net
robots.txt

Robots Exclusion Standard data for learngreen.net

Resource Scan

Scan Details

Site Domain learngreen.net
Base Domain learngreen.net
Scan Status Ok
Last Scan2025-11-02T09:19:19+00:00
Next Scan 2025-11-09T09:19:19+00:00

Last Scan

Scanned2025-11-02T09:19:19+00:00
URL https://learngreen.net/robots.txt
Redirect https://www.learngreen.net/robots.txt
Redirect Domain www.learngreen.net
Redirect Base learngreen.net
Domain IPs 2001:4860:4802:32::15, 2001:4860:4802:34::15, 2001:4860:4802:36::15, 2001:4860:4802:38::15, 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 2404:6800:4003:c02::79, 64.233.170.121
Response IP 142.251.12.121
Found Yes
Hash a36c618c1dfe72bb9f7f0f9b3268cee41bc5e88b8999d55aea6d9ebf449ead8b
SimHash 4c018dd5cdb1

Groups

googlebot

Rule Path
Disallow /nogooglebot/
Allow /ads.txt

*

Rule Path
Allow /
Disallow /wp-admin/
Disallow /search
Disallow /category/
Disallow /tag/
Disallow /favicon
Disallow /search

Other Records

Field Value
sitemap https://www.codecraftingjava.blogspot.com/sitemap.xml
sitemap https://codecraftingjava.blogspot.com/atom.xml?redirect=false&start-index=1&max-results=500