saggiamente.com
robots.txt
Robots Exclusion Standard data for saggiamente.com
Resource Scan
Scan Details
Site Domain | saggiamente.com |
Base Domain | saggiamente.com |
Scan Status | Ok |
Last Scan | 4/17/2025, 6:28:56 PM |
Next Scan | 4/24/2025, 6:28:56 PM |
Last Scan
Scanned | 4/17/2025, 6:28:56 PM |
URL | https://saggiamente.com/robots.txt |
Domain IPs | 104.21.63.116, 172.67.145.115, 2606:4700:3030::6815:3f74, 2606:4700:3034::ac43:9173 |
Response IP | 104.21.63.116 |
Found | Yes |
Hash | 7f062b95fae8dc93f0880e59783ba87b397f1babad73ed65ff2066a47abab854 |
SimHash | 7c24795382a6 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /donations/ |
Disallow | /saggiascelta/ |
Disallow | /mailcheck/ |
Disallow | /push/ |
Disallow | /resources/ |
Disallow | /track/ |
Disallow | /tools/ |
Disallow | /veritas/ |
Disallow | /jobs/ |
Disallow | /hot-deals/ |
Disallow | /get/ |
Disallow | /go/ |
Disallow | /short/ |
adsbot-google
amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.saggiamente.com/sitemap_index.xml |