horizon2020projects.com
robots.txt

Robots Exclusion Standard data for horizon2020projects.com

Resource Scan

Scan Details

Site Domain horizon2020projects.com
Base Domain horizon2020projects.com
Scan Status Ok
Last Scan2025-10-10T15:33:55+00:00
Next Scan 2025-11-09T15:33:55+00:00

Last Scan

Scanned2025-10-10T15:33:55+00:00
URL http://horizon2020projects.com/robots.txt
Redirect https://horizon2020projects.com/robots.txt
Domain IPs 104.21.50.8, 172.67.198.180, 2606:4700:3031::ac43:c6b4, 2606:4700:3035::6815:3208
Response IP 104.21.50.8
Found Yes
Hash cd780bbfaa2d85c57352411ac925ba290c56544f5d3d55add5fafffc18f2211a
SimHash 49645c8057d2

Groups

*

Rule Path
Disallow /comments/feed
Disallow /feed/$
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /trackback/
Disallow /wp-admin/
Disallow /*.inc$
Disallow */trackback/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://horizon2020projects.com/sitemap.xml

Warnings

  • `` is not a known field.