simoom.net
robots.txt

Robots Exclusion Standard data for simoom.net

Resource Scan

Scan Details

Site Domain simoom.net
Base Domain simoom.net
Scan Status Ok
Last Scan2024-06-14T01:20:39+00:00
Next Scan 2024-07-14T01:20:39+00:00

Last Scan

Scanned2024-06-14T01:20:39+00:00
URL https://simoom.net/robots.txt
Redirect https://www.simoom.net/robots.txt
Redirect Domain www.simoom.net
Redirect Base simoom.net
Domain IPs 2403:3a00:201:1e:49:212:207:67, 49.212.207.67
Redirect IPs 2403:3a00:201:1e:49:212:207:67, 49.212.207.67
Response IP 49.212.207.67
Found Yes
Hash 65b6900d1d41de38ea5f5fcba7ef7699ce5becd99e26ab0a6ade8f95fa8e44bc
SimHash 015eff828183

Groups

http://www.almaden.ibm.com/cs/crawler

Rule Path
Disallow /

appie

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

baidumobaider

Rule Path
Disallow /

baiduimagespider

Rule Path
Disallow /

charlotte

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

hatena-mobile-gateway

Rule Path
Disallow /

megalodon

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

psbot

Rule Path
Disallow /

scspider

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

pockey-gethtml

Rule Path
Disallow /

*

Rule Path
Disallow /common/
Disallow /maeres/
Disallow /warp/
Disallow /*.gif$
Disallow /*.jpeg$
Disallow /*.jpg$
Disallow /*.png$
Disallow /*.bmp$
Disallow /*.mid$
Disallow /*.lzh$
Disallow /*.zip$
Disallow /*.cab$
Disallow /*.cgi$
Allow /common/*.css$