opendemocracy.net
robots.txt

Robots Exclusion Standard data for opendemocracy.net

Resource Scan

Scan Details

Site Domain opendemocracy.net
Base Domain opendemocracy.net
Scan Status Ok
Last Scan2024-05-23T04:43:30+00:00
Next Scan 2024-06-22T04:43:30+00:00

Last Scan

Scanned2024-05-23T04:43:30+00:00
URL https://opendemocracy.net/robots.txt
Redirect https://www.opendemocracy.net/robots.txt
Redirect Domain www.opendemocracy.net
Redirect Base opendemocracy.net
Domain IPs 104.22.6.145, 104.22.7.145, 172.67.24.6, 2606:4700:10::6816:691, 2606:4700:10::6816:791, 2606:4700:10::ac43:1806
Redirect IPs 104.22.6.145, 104.22.7.145, 172.67.24.6, 2606:4700:10::6816:691, 2606:4700:10::6816:791, 2606:4700:10::ac43:1806
Response IP 172.67.24.6
Found Yes
Hash 4304ac0d77ee7deb047d307506b58064dc106b3429c6d9bc223268e1f7affe52
SimHash 79101952e2a0

Groups

*

Rule Path
Disallow /admin/
Disallow /en/openmovements/
Disallow /es/covid-19-demoabierta/
Disallow /en/5050/tracking-the-backlash/

linespider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

adsbot-google
amazonbot
anthropic-ai
applebot
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
googleother
gptbot
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
seekr
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.opendemocracy.net/sitemap.xml
sitemap https://www.opendemocracy.net/news-sitemap.xml