sitedemploi.com
robots.txt

Robots Exclusion Standard data for sitedemploi.com

Resource Scan

Scan Details

Site Domain sitedemploi.com
Base Domain sitedemploi.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2026-01-20T22:34:31+00:00
Next Scan 2026-02-19T22:34:31+00:00

Last Successful Scan

Scanned2025-12-22T19:17:05+00:00
URL https://sitedemploi.com/robots.txt
Redirect https://www.sitedemploi.com/robots.txt
Redirect Domain www.sitedemploi.com
Redirect Base sitedemploi.com
Domain IPs 142.44.144.140
Redirect IPs 142.44.144.140
Response IP 142.44.144.140
Found Yes
Hash 297a8b81c7e252f7eaf3826135f7e625afb49d62d3f09d27589081af4d4f64e1
SimHash b91c1280c3cb

Groups

*

Rule Path
Disallow /admin/
Disallow /App_Data/
Disallow /bin/
Disallow /ckeditor/
Disallow /ckfinder/
Disallow /test-mail/
Disallow /JpegImage.aspx

admantx

Rule Path
Disallow /

crystalsemantics

Rule Path
Disallow /

spbot

Rule Path
Disallow /

Comments

  • robots.txt -http://www.sitedemploi.com
  • ------------------------
  • Start of Bot Blocking
  • ------------------------
  • Block Admantx Bot
  • ------------------------
  • Block XaxisSemanticsClassifier Bot
  • ------------------------
  • ------------------------
  • Block OpenLinkProfiler.org
  • ------------------------
  • ------------------------