altalex.com
robots.txt
Robots Exclusion Standard data for altalex.com
Resource Scan
Scan Details
Site Domain | altalex.com |
Base Domain | altalex.com |
Scan Status | Ok |
Last Scan | 2024-11-14T15:22:52+00:00 |
Next Scan | 2024-11-21T15:22:52+00:00 |
Last Scan
Scanned | 2024-11-14T15:22:52+00:00 |
URL | https://altalex.com/robots.txt |
Redirect | https://www.altalex.com/robots.txt |
Redirect Domain | www.altalex.com |
Redirect Base | altalex.com |
Domain IPs | 104.18.28.79, 104.18.29.79, 2606:4700::6812:1c4f, 2606:4700::6812:1d4f |
Redirect IPs | 104.18.28.79, 104.18.29.79, 2606:4700::6812:1c4f, 2606:4700::6812:1d4f |
Response IP | 104.18.29.79 |
Found | Yes |
Hash | 97266eb6c320a9598d56dc19c66511be7a71098f5e8aa57c9080b07bbeeb981c |
SimHash | 740e0911c766 |
Groups
*
Rule | Path |
---|---|
Disallow | /*/ultimo$ |
Disallow | /*/ultimo/ |
Disallow | */ultimo$ |
Disallow | */ultimo/* |
Disallow | /api/ |
Disallow | /boxes/ |
Disallow | /strumentidocumentoword/ |
ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
diffbot
duckassistbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
perplexitybot
petalbot
scrapy
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.altalex.com/sitemap_indice_news.xml |
sitemap | https://www.altalex.com/sitemap_indice_banche_dati.xml |
sitemap | https://www.altalex.com/sitemap_indice_sezioni.xml |
Comments