in.pearson.com
robots.txt

Robots Exclusion Standard data for in.pearson.com

Resource Scan

Scanned	2025-11-29T11:24:39+00:00
URL	https://in.pearson.com/robots.txt
Domain IPs	23.41.19.159
Response IP	23.39.5.176
Found	Yes
Hash	2af68ee838c9eebfb6f9d8a47555674c1c10a987daa82b4bd19bfc8efc23ca93
SimHash	e0514445c593

Rule	Path
Disallow	/en/pdc-new-en

Rule

Path

Disallow

/en/pdc-new-en

Back to top

Field	Value
sitemap	https://in.pearson.com/sitemap.xml

Field

Value

sitemap

https://in.pearson.com/sitemap.xml

Back to top