beauxarts.com
robots.txt

Robots Exclusion Standard data for beauxarts.com

Resource Scan

Scan Details

Site Domain beauxarts.com
Base Domain beauxarts.com
Scan Status Ok
Last Scan2024-05-08T02:50:17+00:00
Next Scan 2024-06-07T02:50:17+00:00

Last Scan

Scanned2024-05-08T02:50:17+00:00
URL https://beauxarts.com/robots.txt
Redirect https://www.beauxarts.com/robots.txt
Redirect Domain www.beauxarts.com
Redirect Base beauxarts.com
Domain IPs 104.26.2.62, 104.26.3.62, 172.67.70.204, 2606:4700:20::681a:23e, 2606:4700:20::681a:33e, 2606:4700:20::ac43:46cc
Redirect IPs 104.26.2.62, 104.26.3.62, 172.67.70.204, 2606:4700:20::681a:23e, 2606:4700:20::681a:33e, 2606:4700:20::ac43:46cc
Response IP 104.26.2.62
Found Yes
Hash ec60961bf535d3575f02dc07bff0242fcb5c41b86501e4e82a1197607785ce82
SimHash 43080910a573

Groups

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*
googlebot-news
*

Rule Path
Disallow /wp-login.php*
Disallow /wp-admin
Disallow /wp-includes
Disallow /21839324592/
Disallow /abonnement/
Disallow /attachment/
Disallow /mon-compte/
Disallow /creation-de-compte/
Disallow /activation/
Disallow /panier/
Disallow /selection/
Disallow /?attribute_pa*
Disallow /guide/guide-resultats*
Disallow /*.pdf
Disallow /?s=*
Disallow /exposition*
Disallow /grand-format/des-etoiles-sur-la-toile/
Disallow /selection/

Other Records

Field Value
sitemap https://www.beauxarts.com/sitemap-news.xml
sitemap https://www.beauxarts.com/sitemap.xml
sitemap https://www.beauxarts.com/sitemap-index.xml
sitemap https://www.beauxarts.com/sitemap-news.xml