saggiamente.com
robots.txt

Robots Exclusion Standard data for saggiamente.com

Resource Scan

Scan Details

Site Domain saggiamente.com
Base Domain saggiamente.com
Scan Status Ok
Last Scan4/17/2025, 6:28:56 PM
Next Scan 4/24/2025, 6:28:56 PM

Last Scan

Scanned4/17/2025, 6:28:56 PM
URL https://saggiamente.com/robots.txt
Domain IPs 104.21.63.116, 172.67.145.115, 2606:4700:3030::6815:3f74, 2606:4700:3034::ac43:9173
Response IP 104.21.63.116
Found Yes
Hash 7f062b95fae8dc93f0880e59783ba87b397f1babad73ed65ff2066a47abab854
SimHash 7c24795382a6

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /donations/
Disallow /saggiascelta/
Disallow /mailcheck/
Disallow /push/
Disallow /resources/
Disallow /track/
Disallow /tools/
Disallow /veritas/
Disallow /jobs/
Disallow /hot-deals/
Disallow /get/
Disallow /go/
Disallow /short/

adsbot-google
amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.saggiamente.com/sitemap_index.xml