awartisan.pt
robots.txt

Robots Exclusion Standard data for awartisan.pt

Resource Scan

Scan Details

Site Domain awartisan.pt
Base Domain awartisan.pt
Scan Status Ok
Last Scan2024-09-21T04:23:30+00:00
Next Scan 2024-10-21T04:23:30+00:00

Last Scan

Scanned2024-09-21T04:23:30+00:00
URL https://awartisan.pt/robots.txt
Redirect https://www.awartisan.pt/robots.txt
Redirect Domain www.awartisan.pt
Redirect Base awartisan.pt
Domain IPs 104.21.70.4, 172.67.217.42, 2606:4700:3031::ac43:d92a, 2606:4700:3035::6815:4604
Redirect IPs 104.21.70.4, 172.67.217.42, 2606:4700:3031::ac43:d92a, 2606:4700:3035::6815:4604
Response IP 104.21.70.4
Found Yes
Hash 91e9d3e56e14e91119d00e11a2b4432c82d0b99e988b7ed41f7bc49fddb6f089
SimHash 081d5f08e0d0

Groups

*

Rule Path
Disallow /*.pdf$
Disallow /return_policy
Disallow /privacy_policy
Disallow /cookies
Disallow /attachment.php*
Disallow /asset_label*
Disallow /page.php*
Disallow /*.sys$
Disallow /ethics
Disallow /image_root*

Other Records

Field Value
sitemap https://www.awartisan.pt/sitemaps/es_pt.xml.gz