greap.blog
robots.txt

Robots Exclusion Standard data for greap.blog

Archived Snapshots

Resource Scan

Scan Details

Site Domain	greap.blog
Base Domain	greap.blog
Scan Status	Ok
Last Scan	2025-12-18T00:50:45+00:00
Next Scan	2025-12-25T00:50:45+00:00

Last Scan

Scanned	2025-12-18T00:50:45+00:00
URL	https://greap.blog/robots.txt
Domain IPs	162.244.95.12, 2602:faa9:4002:210:30e0:944e:2900:1ec1
Response IP	162.244.95.12
Found	Yes
Hash	98fc84710ed955aedc2f7e6c716942b4ba1705142366eed08b61702d7eae0da6
SimHash	5b544b21e4e6

Groups

*

Rule	Path
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php
Allow	/feed/
Allow	*/feed/
Allow	*.jpg
Allow	*.jpeg
Allow	*.png
Allow	*.gif
Allow	*.svg
Allow	*.webp
Allow	*.pdf

Rule

Path

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

Allow

/feed/

Allow

*/feed/

Allow

*.jpg

Allow

*.jpeg

Allow

*.png

Allow

*.gif

Allow

*.svg

Allow

*.webp

Allow

*.pdf

googlebot
googlebot-image

Rule	Path
Allow	/

Rule

Path

Allow

/

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

/

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://greap.blog/sitemap_index.xml
sitemap	https://greap.blog/post-sitemap1.xml
sitemap	https://greap.blog/post-sitemap2.xml
sitemap	https://greap.blog/post-sitemap3.xml
sitemap	https://greap.blog/post-sitemap4.xml
sitemap	https://greap.blog/page-sitemap.xml
sitemap	https://greap.blog/category-sitemap.xml

Field

Value

sitemap

https://greap.blog/sitemap_index.xml

sitemap

https://greap.blog/post-sitemap1.xml

sitemap

https://greap.blog/post-sitemap2.xml

sitemap

https://greap.blog/post-sitemap3.xml

sitemap

https://greap.blog/post-sitemap4.xml

sitemap

https://greap.blog/page-sitemap.xml

sitemap

https://greap.blog/category-sitemap.xml

Back to top

Comments

Explicitly allow RSS feeds for Google Discover Follow feature
Allow all media files for Google Discover visual content
Sitemaps - Main Index + All Individual Sitemaps for Maximum Discovery
Specific bots configuration

Back to top

greap.blogrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebotgooglebot-image

adsbot-google

mediapartners-google

Other Records

Comments

greap.blog
robots.txt

googlebot
googlebot-image