in.ixl.com
robots.txt

Robots Exclusion Standard data for in.ixl.com

Resource Scan

Scan Details

Site Domain in.ixl.com
Base Domain ixl.com
Scan Status Ok
Last Scan2024-11-04T22:07:20+00:00
Next Scan 2024-11-18T22:07:20+00:00

Last Scan

Scanned2024-11-04T22:07:20+00:00
URL https://in.ixl.com/robots.txt
Domain IPs 104.16.224.31, 104.16.240.30, 104.18.128.25, 104.18.175.254, 104.18.191.254
Response IP 104.18.128.25
Found Yes
Hash ffbadf56f89e946e642bf6e271429a0dff1832244f961ee7e5403da263ae731c
SimHash 68dcd381adf2

Groups

*

Rule Path
Disallow /servlets
Disallow /rotate_text
Disallow /printstandards/
Disallow /certificate/
Disallow /sharepage
Disallow /practice/tally
Disallow /practice/cease
Disallow /practice/summary
Disallow /practice/cutoff
Disallow /practice/smartscoreToolTip
Disallow /practice/smartscore
Disallow /practice/tts
Disallow /practice/diagnose/
Disallow /practice-help
Disallow /florida
Disallow /forgot/
Disallow /forward_newsletter
Disallow /signin/subaccount
Disallow /signin/ajax
Disallow /signin/ajax-homepage
Disallow /signin/silent
Disallow /signin/help
Disallow /signin/dql
Disallow /diagnostic/viewQuestionsLog
Disallow /_test/
Disallow /resources/webinar-schedule?
Disallow /begin

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.ixl.com/sitemap_index.xml

Comments

  • -----------------------------------------------------------------------------
  • Areas that search robots should avoid
  • (c) 2011 IXL Learning. All rights reserved.
  • created by jkent on 8 Mar 2002
  • Site-friendly search robots use this file to determine where _not_
  • to go. Some URL spaces are simply counterproductive.
  • -----------------------------------------------------------------------------