insideiim.com
robots.txt

Robots Exclusion Standard data for insideiim.com

Resource Scan

Scan Details

Site Domain insideiim.com
Base Domain insideiim.com
Scan Status Ok
Last Scan2024-09-22T02:42:23+00:00
Next Scan 2024-10-22T02:42:23+00:00

Last Scan

Scanned2024-09-22T02:42:23+00:00
URL https://insideiim.com/robots.txt
Domain IPs 52.84.229.112, 52.84.229.121, 52.84.229.46, 52.84.229.49
Response IP 52.84.229.112
Found Yes
Hash 1e4dae2b0cbb93e91af86b648a4cb24007a4a857f40f81e3ed1779a0a12dc771
SimHash 40400b32cfb0

Groups

*

Rule Path
Allow /profile/*.css$
Allow /profile/*.css?
Allow /profile/*.js$
Allow /profile/*.js?
Allow /profile/*.gif
Allow /profile/*.jpg
Allow /profile/*.jpeg
Allow /profile/*.png
Disallow /*partial/
Disallow /*?s=
Disallow /*?p=
Disallow /*?cx=
Disallow /search_gcse/*
Disallow /search?q=*
Disallow /auth/*
Disallow /write-a-story?post_id=*
Disallow /author/stories/drafts
Disallow /write-a-story
Disallow /goals/my-goals
Disallow /community/my-questions
Disallow /notification/list
Disallow /*mailer%3D*

Comments

  • CSS, JS, Images -
  • Paths (for clean URLs)