necf.org.my
robots.txt

Robots Exclusion Standard data for necf.org.my

Resource Scan

Scan Details

Site Domain necf.org.my
Base Domain necf.org.my
Scan Status Ok
Last Scan2024-09-13T03:08:47+00:00
Next Scan 2024-10-13T03:08:47+00:00

Last Scan

Scanned2024-09-13T03:08:47+00:00
URL https://www.necf.org.my/robots.txt
Domain IPs 108.157.254.121, 108.157.254.55, 108.157.254.66, 108.157.254.75, 2600:9000:2753:1600:8:bed1:9940:93a1, 2600:9000:2753:4a00:8:bed1:9940:93a1, 2600:9000:2753:600:8:bed1:9940:93a1, 2600:9000:2753:6600:8:bed1:9940:93a1, 2600:9000:2753:7a00:8:bed1:9940:93a1, 2600:9000:2753:e200:8:bed1:9940:93a1, 2600:9000:2753:e400:8:bed1:9940:93a1, 2600:9000:2753:ec00:8:bed1:9940:93a1
Response IP 108.157.254.55
Found Yes
Hash a487280874721e140797968737ca8e3d9d125d163c365ec9e49f6650bc4e7f92
SimHash 2d40bd7a4a22

Groups

*

Rule Path
Disallow /activeedit_plugins/
Disallow /admin/
Disallow /aeimages/
Disallow /aspnet_client/
Disallow /component/
Disallow /documentstore/
Disallow /dropdownmenu/
Disallow /emailattachment/
Disallow /fckeditor/
Disallow /general/
Disallow /images/
Disallow /inc/
Disallow /js/
Disallow /member/
Disallow /player/
Disallow /searchcollection/
Disallow /searchengine/
Disallow /subscribe/
Disallow /survey/
Disallow /tempstore/
Disallow /thematic/
Disallow /updatesubscription/
Disallow /workspace/
Disallow /video/

Other Records

Field Value
crawl-delay 1

Comments

  • Comments - we set most CMS directories to disallow kllam