worlddata.info
robots.txt

Robots Exclusion Standard data for worlddata.info

Resource Scan

Scan Details

Site Domain worlddata.info
Base Domain worlddata.info
Scan Status Ok
Last Scan2024-09-21T16:47:23+00:00
Next Scan 2024-09-28T16:47:23+00:00

Last Scan

Scanned2024-09-21T16:47:23+00:00
URL https://www.worlddata.info/robots.txt
Domain IPs 13.35.18.114, 13.35.18.125, 13.35.18.18, 13.35.18.73, 2600:9000:20c7:1200:1:5ab3:6040:93a1, 2600:9000:20c7:4400:1:5ab3:6040:93a1, 2600:9000:20c7:4e00:1:5ab3:6040:93a1, 2600:9000:20c7:7e00:1:5ab3:6040:93a1, 2600:9000:20c7:9c00:1:5ab3:6040:93a1, 2600:9000:20c7:a00:1:5ab3:6040:93a1, 2600:9000:20c7:b000:1:5ab3:6040:93a1, 2600:9000:20c7:e400:1:5ab3:6040:93a1
Response IP 13.35.18.73
Found Yes
Hash 43cf605e1f2520c1046c3d75db7ed82e2d974b47c17c9139eff0b67161c04ecc
SimHash 4810da728b13

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

siteliner

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

*

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.worlddata.info/sitemap.xml