georss.org
robots.txt

Robots Exclusion Standard data for georss.org

Resource Scan

Scan Details

Site Domain georss.org
Base Domain georss.org
Scan Status Ok
Last Scan2025-08-20T17:46:45+00:00
Next Scan 2025-09-19T17:46:45+00:00

Last Scan

Scanned2025-08-20T17:46:45+00:00
URL https://georss.org/robots.txt
Domain IPs 104.21.1.235, 172.67.152.147, 2606:4700:3030::ac43:9893, 2606:4700:3035::6815:1eb
Response IP 104.21.1.235
Found Yes
Hash 33191670da87446ebb6dacb58e8370ee4d3f84bb95e4fd321468f479f402383b
SimHash 48445e82d792

Groups

*

Rule Path
Disallow /comments/feed
Disallow /feed/$
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /trackback/
Disallow /wp-admin/
Disallow /*.inc$
Disallow */trackback/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://georss.org/sitemap_index.xml