celineguichard.name
robots.txt
Robots Exclusion Standard data for celineguichard.name
Resource Scan
Scan Details
| Site Domain | celineguichard.name |
| Base Domain | celineguichard.name |
| Scan Status | Ok |
| Last Scan | 2025-12-21T23:48:54+00:00 |
| Next Scan | 2025-12-22T23:48:54+00:00 |
Last Scan
| Scanned | 2025-12-21T23:48:54+00:00 |
| URL | https://celineguichard.name/robots.txt |
| Domain IPs | 2a02:4780:84:5998:52f6:9e12:f58c:88f3, 2a02:4780:84:d483:7e93:6d5f:cc3a:b3ee, 77.37.66.33, 93.127.196.232 |
| Response IP | 93.127.196.139 |
| Found | Yes |
| Hash | 1af4dd9e6c42608316a39b568aafe7ce01b020cedbbe661781c1326be3725595 |
| SimHash | 6d9889834282 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-content/uploads/wc-logs/ |
| Disallow | /wp-content/uploads/woocommerce_transient_files/ |
| Disallow | /wp-content/uploads/woocommerce_uploads/ |
| Disallow | /*?add-to-cart= |
| Disallow | /*?*add-to-cart= |
| Disallow | /wp-admin/ |
| Allow | /wp-admin/admin-ajax.php |
addsearchbot
ai2bot
ai2bot-dolma
aihitbot
amazonbot
applebot-extended
anthropic-ai
bedrockbot
bigsur.ai
brightbot 1.0
bytespider
ccbot
chatgpt-user
claudebot
claude-user
claude-searchbot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
deepseekbot
diffbot
echoboxbot
facebookbot
factset_spyderbot
firecrawlagent
friendlycrawler
gptbot
googleagent-mariner
gemini-deep-research
google-extended
imagesiftbot
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
panscient
panscient.com
petalbot
perplexitybot
perplexity‑user
poseidon research crawler
sbintuitionsbot
scrapy
semrushbot
semrushbot-ocob
semrushbot-ft
sentibot
sentibot
terracotta
thinkbot
timpibot
turnitinbot
yak
yandexadditional
yandexadditionalbot
youbot
webzio
webzio-extended
| Rule | Path |
|---|---|
| Disallow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | https://celineguichard.name/sitemap.xml |
| sitemap | https://celineguichard.name/news-sitemap.xml |
Comments