sumika.me
robots.txt

Robots Exclusion Standard data for sumika.me

Resource Scan

Scan Details

Site Domain sumika.me
Base Domain sumika.me
Scan Status Ok
Last Scan2024-11-01T18:25:20+00:00
Next Scan 2024-11-08T18:25:20+00:00

Last Scan

Scanned2024-11-01T18:25:20+00:00
URL https://sumika.me/robots.txt
Domain IPs 13.33.30.124, 13.33.30.57, 13.33.30.65, 13.33.30.72
Response IP 13.33.30.57
Found Yes
Hash 0b9a9cbd1e3b635e2a4c24b3efc4a313509bdcb0bbd9ff40c89920fa06d149b6
SimHash 684591714ff4

Groups

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /u/
Disallow /p/
Disallow /20739468/
Disallow /pro_profiles/*/fb_link
Disallow /pro_profiles/*/hp_link
Disallow /search/pro/rm/*/pa/*/p/*
Disallow /search/pro/rm/*/pa/*/pp/*/p/*
Disallow /pros/area/*/p/*
Disallow /pros/role/*/p/*
Disallow /jobs/search/?page=*
Disallow /jobs/search?page=*
Disallow /seek_advices/search/*
Disallow /*.pdf$
Allow /u/photo/*/clip_count

Other Records

Field Value
sitemap https://sumika.me/sitemap.xml.gz