jeewangarg.com
robots.txt

Robots Exclusion Standard data for jeewangarg.com

Resource Scan

Scan Details

Site Domain jeewangarg.com
Base Domain jeewangarg.com
Scan Status Ok
Last Scan2024-11-05T21:32:07+00:00
Next Scan 2024-11-12T21:32:07+00:00

Last Scan

Scanned2024-11-05T21:32:07+00:00
URL https://jeewangarg.com/robots.txt
Redirect https://www.jeewangarg.com/robots.txt
Redirect Domain www.jeewangarg.com
Redirect Base jeewangarg.com
Domain IPs 190.92.174.35
Redirect IPs 190.92.174.35
Response IP 190.92.174.35
Found Yes
Hash 1866959b6dbd0cb288af0bbbadd1d98d982ea33ed44f2bb29c36790f40351569
SimHash 800dee23167f

Groups

*

Rule Path
Allow /*.js
Allow /*.css
Allow /*.jpg
Allow /*.png

httrack disallow /
httrack disallow: /
netcaptor disallow /
netcaptor disallow: /
offline explorer disallow /
offline explorer disallow: /
spiderku/0.9 disallow /
spiderku/0.9 disallow: /
steeler disallow /
steeler disallow: /
webcopier v3.3 disallow /
webcopier v3.3 disallow: /
webcopier v3.2a disallow /
webcopier v3.2a disallow: /
webcopier disallow /
webcopier disallow: /
webcrawler disallow: /
web downloader/4.9 disallow /
web downloader/4.9 disallow: /
web downloader/5.8 disallow /
web downloader/5.8 disallow: /
webgather 3.0 disallow /
webgather 3.0 disallow: /
webstripper/2.56 disallow /
webstripper/2.56 disallow: /
webzip/3.65 disallow /
webzip/3.65 disallow: /
webzip disallow /
webzip disallow: /
wget disallow /
wget disallow: /
zao disallow /
zao disallow: /
zeus 2.6 disallow /
zeus 2.6 disallow: /

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.jeewangarg.com/sitemap.xml
sitemap https://www.jeewangarg.com/post-sitemap.xml

Warnings

  • 2 invalid lines.