whitepages.com
robots.txt

Robots Exclusion Standard data for whitepages.com

Resource Scan

Scan Details

Site Domain whitepages.com
Base Domain whitepages.com
Scan Status Ok
Last Scan2024-05-04T14:04:05+00:00
Next Scan 2024-05-11T14:04:05+00:00

Last Scan

Scanned2024-05-04T14:04:05+00:00
URL https://whitepages.com/robots.txt
Redirect https://www.whitepages.com/robots.txt
Redirect Domain www.whitepages.com
Redirect Base whitepages.com
Domain IPs 13.226.210.19, 13.226.210.51, 13.226.210.58, 13.226.210.73, 2600:9000:201d:1400:9:2c5d:cdc0:93a1, 2600:9000:201d:4a00:9:2c5d:cdc0:93a1, 2600:9000:201d:4e00:9:2c5d:cdc0:93a1, 2600:9000:201d:5000:9:2c5d:cdc0:93a1, 2600:9000:201d:9e00:9:2c5d:cdc0:93a1, 2600:9000:201d:d400:9:2c5d:cdc0:93a1, 2600:9000:201d:e600:9:2c5d:cdc0:93a1, 2600:9000:201d:e800:9:2c5d:cdc0:93a1
Redirect IPs 104.18.41.32, 172.64.146.224, 2606:4700:4400::6812:2920, 2606:4700:4400::ac40:92e0
Response IP 104.18.41.32
Found Yes
Hash a7bfc8756479e5dd339521ad6568994e65b4119819465f5bc0a7cfba48840721
SimHash 2c8ad872d8a0

Groups

*

Rule Path
Allow /ads.txt
Allow /name/*/*/P
Allow /name/*/*
Allow /name/*
Allow /address/*/*/
Allow /phone/US
Allow /phone/us
Allow /phone/1-
Allow /person
Allow /reverse-phone
Allow /reverse-address
Allow /email-search
Allow /background-checks
Allow /blog
Allow /about
Allow /white-pages
Allow /directory/
Allow /api/
Allow /_nuxt/
Allow /assets/
Allow /static/
Allow /cfcdn/
Allow /privacy
Allow /terms-of-service
Allow /checkout/pricing
Allow /business-pricing
Allow /wp-content/
Allow /wp-includes/
Allow /site-map
Allow /.well-known/assetlinks.json
Allow /favicon.ico$
Allow *.xml$
Allow *.xml.gz$
Allow /$
Allow /?
Disallow /
Disallow *?
Disallow /name/*/*/
Disallow /api/location/autocomplete-suggestions
Disallow /api/log-info
Disallow /api/person/ancestry
Disallow /address/*/residents$