webdunia.com
robots.txt

Robots Exclusion Standard data for webdunia.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	webdunia.com
Base Domain	webdunia.com
Scan Status	Ok
Last Scan	2024-06-05T23:41:20+00:00
Next Scan	2024-06-12T23:41:20+00:00

Last Scan

Scanned	2024-06-05T23:41:20+00:00
URL	https://webdunia.com/robots.txt
Redirect	https://hindi.webdunia.com/robots.txt
Redirect Domain	hindi.webdunia.com
Redirect Base	webdunia.com
Domain IPs	3.108.168.170
Redirect IPs	104.17.113.3, 104.17.114.3, 2606:4700::6811:7103, 2606:4700::6811:7203
Response IP	104.17.113.3
Found	Yes
Hash	aff454126d4844b729d2517469e2561cc4b5efb3b91e904e630a482f738f0447
SimHash	4946cb640553

Groups

*

Rule	Path
Allow	/
Disallow	/includes
Disallow	/include
Disallow	/adinnovations
Disallow	/sports/cricket/scorecard/cricketscores
Disallow	/1031084

Rule

Path

Allow

/

Disallow

/includes

Disallow

/include

Disallow

/adinnovations

Disallow

/sports/cricket/scorecard/cricketscores

Disallow

/1031084

googlebot-news

No rules defined. All paths allowed.

Back to top

Other Records

Field	Value
sitemap	https://hindi.webdunia.com/sitemaps/sitemap.xml
sitemap	https://hindi.webdunia.com/sitemaps/googlesitemap.xml

Field

Value

sitemap

https://hindi.webdunia.com/sitemaps/sitemap.xml

sitemap

https://hindi.webdunia.com/sitemaps/googlesitemap.xml

Back to top

Warnings

`noindex` is not a known field.

Back to top

webdunia.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot-news

Other Records

Warnings

webdunia.com
robots.txt