webcomindia.net
robots.txt

Robots Exclusion Standard data for webcomindia.net

Archived Snapshots

Resource Scan

Scan Details

Site Domain	webcomindia.net
Base Domain	webcomindia.net
Scan Status	Ok
Last Scan	2026-01-07T20:12:14+00:00
Next Scan	2026-02-06T20:12:14+00:00

Last Scan

Scanned	2026-01-07T20:12:14+00:00
URL	https://webcomindia.net/robots.txt
Domain IPs	198.143.158.45
Response IP	198.143.158.45
Found	Yes
Hash	608f1915d109697a9c7088131145841f9d291a03069d5a811e39329746dcd0a5
SimHash	290d8900cdd1

Groups

mediapartners-google*

Rule	Path
Disallow

Rule

Path

Disallow

scooter

Rule	Path
Disallow

Rule

Path

Disallow

fast-webcrawler

Rule	Path
Disallow

Rule

Path

Disallow

googlebot

Rule	Path
Disallow

Rule

Path

Disallow

slurp

Rule	Path
Disallow

Rule

Path

Disallow

lycos_spider_(t-rex)

Rule	Path
Disallow

Rule

Path

Disallow

*

Rule	Path
Disallow

Rule

Path

Disallow

Back to top

Other Records

Field	Value
sitemap	https://www.webcomindia.net/sitemap.xml

Field

Value

sitemap

https://www.webcomindia.net/sitemap.xml

Back to top

Comments

FULL access (Alta Vista)
FULL access (FAST/AllTheWeb)
FULL access (Google)
FULL access (Inktomi)
FULL access (Lycos)
FULL access (All Spiders)

Back to top

webcomindia.netrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

mediapartners-google*

scooter

fast-webcrawler

googlebot

slurp

lycos_spider_(t-rex)

*

Other Records

Comments

webcomindia.net
robots.txt