cxem.net
robots.txt

Robots Exclusion Standard data for cxem.net

Resource Scan

Scan Details

Site Domain cxem.net
Base Domain cxem.net
Scan Status Ok
Last Scan2024-11-16T02:00:56+00:00
Next Scan 2024-11-23T02:00:56+00:00

Last Scan

Scanned2024-11-16T02:00:56+00:00
URL https://cxem.net/robots.txt
Domain IPs 78.46.106.238
Response IP 78.46.106.238
Found Yes
Hash 8d86f25178640123b315358b02bbf8fb10534edf119a98e88e219e39a03308cf
SimHash 6918b843ca30

Groups

*

Rule Path
Disallow /forum/
Disallow /news/
Disallow /shop/
Disallow /search.php
Disallow /articles_elparts.php
Disallow *.php/
Disallow /images/banners/
Disallow /articles.php
Disallow /getPDF.php
Disallow /elshop.php
Disallow /electronic_news/electronic_news.php
Disallow /tags/
Disallow /cms/login/
Disallow /cms/profile/
Disallow /cms/elshop/
Disallow /profile/
Disallow /cms/showdrafts/

gptbot
claudebot
claude-web
ccbot
applebot-extended
facebookbot
meta-externalagent
diffbot
perplexitybot
omgili
omgilibot
imagesiftbot
bytespider
amazonbot
youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cxem.net/sitemap.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.