samedi.com
robots.txt

Robots Exclusion Standard data for samedi.com

Resource Scan

Scan Details

Site Domain samedi.com
Base Domain samedi.com
Scan Status Ok
Last Scan2024-08-30T01:23:58+00:00
Next Scan 2024-09-29T01:23:58+00:00

Last Scan

Scanned2024-08-30T01:23:58+00:00
URL https://samedi.com/robots.txt
Redirect https://www.samedi.com/robots.txt
Redirect Domain www.samedi.com
Redirect Base samedi.com
Domain IPs 199.60.103.106, 199.60.103.6
Redirect IPs 2600:9000:2024:1400:4:c85c:8840:93a1, 2600:9000:2024:2400:4:c85c:8840:93a1, 2600:9000:2024:4c00:4:c85c:8840:93a1, 2600:9000:2024:6200:4:c85c:8840:93a1, 2600:9000:2024:7800:4:c85c:8840:93a1, 2600:9000:2024:8200:4:c85c:8840:93a1, 2600:9000:2024:8600:4:c85c:8840:93a1, 2600:9000:2024:b200:4:c85c:8840:93a1, 3.164.85.118, 3.164.85.18, 3.164.85.31, 3.164.85.36
Response IP 18.165.171.62
Found Yes
Hash ce586b6fbe4a73e51d9d23b097b1a0e2f17d50a2c431e768f7e6109cbb4886bd
SimHash 635819562e91

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.samedi.com/sitemaps-1-sitemap.xml
sitemap https://www.samedi.com/en/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.samedi.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/