theserverside.com
robots.txt

Robots Exclusion Standard data for theserverside.com

Resource Scan

Scan Details

Site Domain theserverside.com
Base Domain theserverside.com
Scan Status Ok
Last Scan2024-09-20T11:10:27+00:00
Next Scan 2024-09-27T11:10:27+00:00

Last Scan

Scanned2024-09-20T11:10:27+00:00
URL https://theserverside.com/robots.txt
Redirect https://www.theserverside.com/robots.txt
Redirect Domain www.theserverside.com
Redirect Base theserverside.com
Domain IPs 104.18.24.109, 104.18.25.109, 2606:4700::6812:186d, 2606:4700::6812:196d
Redirect IPs 104.18.24.109, 104.18.25.109, 2606:4700::6812:186d, 2606:4700::6812:196d
Response IP 104.18.25.109
Found Yes
Hash 354cca4341fd5a5a0a2acf0c639f51a154227e139a0c59e9216cc009505cb779
SimHash 4c54d5c247b8

Groups

*

Rule Path
Disallow /search/query?q=*
Disallow /search/query

mediapartners-google

Rule Path
Allow /search/
Allow /googleNews.xml

updownerbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /search/query?q=*
Disallow /search/query
Disallow /*?s=*

Other Records

Field Value
crawl-delay 3

msnbot

Rule Path
Disallow /search/query?q=*
Disallow /search/query
Disallow /*?s=

Other Records

Field Value
crawl-delay 3

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://www.theserverside.com/siteIndex.xml
sitemap https://www.theserverside.com/googleNews.xml
sitemap https://www.theserverside.com/siteIndexVideo.xml

Warnings

  • 2 invalid lines.