emanualonline.com
robots.txt

Robots Exclusion Standard data for emanualonline.com

Resource Scan

Scan Details

Site Domain emanualonline.com
Base Domain emanualonline.com
Scan Status Ok
Last Scan2024-11-15T13:14:50+00:00
Next Scan 2024-11-22T13:14:50+00:00

Last Scan

Scanned2024-11-15T13:14:50+00:00
URL https://emanualonline.com/robots.txt
Redirect https://www.emanualonline.com/robots.txt
Redirect Domain www.emanualonline.com
Redirect Base emanualonline.com
Domain IPs 149.28.56.39
Redirect IPs 45.42.40.185
Response IP 108.181.57.149
Found Yes
Hash 496a5ec4bf54d0ceb52ba388b4e7055428d016b0a7748fb2fc2cba44a48555fd
SimHash 4121f9034dd0

Groups

*

Rule Path
Allow /
Disallow /index.php/
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalogsearch/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /magento/
Disallow /catalog/product/view/id/
Disallow /*?product_list_order*
Disallow /*?searchtype*
Disallow /*?abquestionid*
Disallow /*?product_list_dir*
Disallow /*?currency=*
Disallow /*?wordfence=*
Disallow /*/feed/*
Disallow /*SID%3D
Disallow /*price%3D*
Disallow /*size%3D*
Disallow /*cat%3D*
Disallow /*?sid=*
Disallow /*?SID=*
Disallow /*?id=*
Disallow /*?ID=*
Disallow /wishlist/
Disallow /cdn-cgi/
Disallow /*referer*
Disallow /blog/cgi-bin
Disallow /blog/wp-admin/
Disallow /blog/*/embed
Disallow /blog/*/xmlrpc.php
Disallow *openstat%3D

googlebot

Rule Path
Disallow /education/*.html
Disallow /Education/*.html

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.emanualonline.com/sitemap.xml
sitemap https://www.emanualonline.com/blog/sitemap_index.xml
sitemap https://www.emanualonline.com/blog/news-sitemap.xml
sitemap https://www.emanualonline.com/qa/sitemapindex.xml

Comments

  • Block /education section for Googlebot