sdxcentral.com
robots.txt

Robots Exclusion Standard data for sdxcentral.com

Resource Scan

Scan Details

Site Domain sdxcentral.com
Base Domain sdxcentral.com
Scan Status Ok
Last Scan2024-10-01T12:59:58+00:00
Next Scan 2024-10-08T12:59:58+00:00

Last Scan

Scanned2024-10-01T12:59:58+00:00
URL https://sdxcentral.com/robots.txt
Redirect https://www.sdxcentral.com/robots.txt
Redirect Domain www.sdxcentral.com
Redirect Base sdxcentral.com
Domain IPs 172.66.41.17, 172.66.42.239, 2606:4700:3108::ac42:2911, 2606:4700:3108::ac42:2aef
Redirect IPs 172.66.41.17, 172.66.42.239, 2606:4700:3108::ac42:2911, 2606:4700:3108::ac42:2aef
Response IP 172.66.41.17
Found Yes
Hash 8125baefa9e1d205060e4c871bf7d94b03035cffe04a3af05c19eb1cae5374cf
SimHash cc1142c9c551

Groups

*

Rule Path
Disallow /

adidxbot
adsbot-google
amazonbot
apis-google
applebot
bingbot
bingpreview
duckduckbot
feedfetcher-google
feedly
facebookexternalhit
google-read-aloud
google-site-verification
googlebot
googleimageproxy
googleother
ia_archiver
kagibot
linkedinbot
mediapartners-google
qwantify
slack-imgproxy
slackbot
slurp
telegrambot
twitterbot
yahoo ad monitoring
yahoomailproxy
yeti

Rule Path
Disallow /?s*
Disallow /?p*
Disallow /search/*
Disallow /feed/*
Disallow /syndicated/*
Disallow *.doc$
Disallow *.docx$
Disallow *.ppt$
Disallow *.pptx$

googlebot-news

Rule Path
Disallow /sponsored/
Disallow /announcements/
Disallow /syndicated/

Other Records

Field Value
sitemap https://www.sdxcentral.com/sitemap_index.xml