philippebedard.net
robots.txt

Robots Exclusion Standard data for philippebedard.net

Resource Scan

Scan Details

Site Domain philippebedard.net
Base Domain philippebedard.net
Scan Status Ok
Last Scan2025-09-04T14:44:46+00:00
Next Scan 2025-09-11T14:44:46+00:00

Last Scan

Scanned2025-09-04T14:44:46+00:00
URL https://philippebedard.net/robots.txt
Redirect https://www.philippebedard.net/robots.txt
Redirect Domain www.philippebedard.net
Redirect Base philippebedard.net
Domain IPs 4.153.215.143
Redirect IPs 4.192.73.169
Response IP 4.192.73.169
Found Yes
Hash 1feb36ff502b215a7e968a621471ccc06731df3f92a9fa6e895b4649cbc07ddb
SimHash 50095971e2a0

Groups

*

Rule Path
Disallow *?replytocom
Disallow *?fb_comment_id
Disallow *?amp
Disallow *?amp-wp-skip-redirect
Disallow *//1000

adsbot-google
amazonbot
anthropic-ai
applebot
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
img2dataset
imagesiftbot
magpie-crawler
meltwater
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
piplbot
scoop.it
seekr
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.philippebedard.net/en/sitemap.xml
sitemap https://www.philippebedard.net/fr/sitemap.xml
sitemap https://www.philippebedard.net/sitemap.xml

Comments

  • Source 1: https://github.com/ai-robots-txt/ai.robots.txt/blob/main/robots.txt
  • Source 2: https://darkvisitors.com/docs/robots-txt