netbhet.com
robots.txt

Robots Exclusion Standard data for netbhet.com

Resource Scan

Scan Details

Site Domain netbhet.com
Base Domain netbhet.com
Scan Status Ok
Last Scan2024-10-29T10:05:27+00:00
Next Scan 2024-11-28T10:05:27+00:00

Last Scan

Scanned2024-10-29T10:05:27+00:00
URL https://netbhet.com/robots.txt
Redirect https://www.netbhet.com/robots.txt
Redirect Domain www.netbhet.com
Redirect Base netbhet.com
Domain IPs 199.34.228.77
Redirect IPs 199.34.228.77
Response IP 199.34.228.77
Found Yes
Hash e2c45f90639fe9b6d447708a9db26088eea21967bbc8977227b1d40fc10b830c
SimHash 2254541622d3

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /test-home-page.html
Disallow /edp.html
Disallow /ganeshaoffer.html
Disallow /thanks.html
Disallow /offers1.html
Disallow /sandbox.html
Disallow /https%3A//learn.netbhet.com/blog
Disallow /facebook-marketing-marathi-course.html
Disallow /dcresources.html
Disallow /18/
Disallow /dcresources/
Disallow /dcresources
Disallow /tsw.html
Disallow /loa-live.html
Disallow /pd-webinar-live.html
Disallow /diwali-thanks.html
Disallow /pid-thanks.html
Disallow /qcdemo.html

Other Records

Field Value
sitemap https://www.netbhet.com/sitemap.xml