webcomindia.net
robots.txt

Robots Exclusion Standard data for webcomindia.net

Resource Scan

Scan Details

Site Domain webcomindia.net
Base Domain webcomindia.net
Scan Status Ok
Last Scan2026-01-07T20:12:14+00:00
Next Scan 2026-02-06T20:12:14+00:00

Last Scan

Scanned2026-01-07T20:12:14+00:00
URL https://webcomindia.net/robots.txt
Domain IPs 198.143.158.45
Response IP 198.143.158.45
Found Yes
Hash 608f1915d109697a9c7088131145841f9d291a03069d5a811e39329746dcd0a5
SimHash 290d8900cdd1

Groups

mediapartners-google*

Rule Path
Disallow

scooter

Rule Path
Disallow

fast-webcrawler

Rule Path
Disallow

googlebot

Rule Path
Disallow

slurp

Rule Path
Disallow

lycos_spider_(t-rex)

Rule Path
Disallow

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.webcomindia.net/sitemap.xml

Comments

  • FULL access (Alta Vista)
  • FULL access (FAST/AllTheWeb)
  • FULL access (Google)
  • FULL access (Inktomi)
  • FULL access (Lycos)
  • FULL access (All Spiders)