gdgdocs.org
robots.txt

Robots Exclusion Standard data for gdgdocs.org

Resource Scan

Scan Details

Site Domain gdgdocs.org
Base Domain gdgdocs.org
Scan Status Ok
Last Scan2024-11-06T18:28:21+00:00
Next Scan 2024-11-20T18:28:21+00:00

Last Scan

Scanned2024-11-06T18:28:21+00:00
URL https://www.gdgdocs.org/robots.txt
Domain IPs 104.21.44.100, 172.67.198.136, 2606:4700:3031::ac43:c688, 2606:4700:3033::6815:2c64
Response IP 172.67.198.136
Found Yes
Hash 63031de6a60170075f6d909f7934aa2211fac6f67159c0dec5862c910402f871
SimHash 4e14423ba719

Groups

*

Rule Path
Allow /$
Allow /?hl=
Disallow /?hl=*&
Allow /support/
Allow /a/
Allow /Doc
Allow /View
Allow /ViewDoc
Allow /present
Allow /Present
Allow /TeamPresent
Allow /EmbedSlideshow
Allow /presentation
Allow /templates
Allow /previewtemplate
Allow /fileview
Allow /gview
Allow /viewer
Allow /leaf
Allow /file
Allow /open
Allow /document
Allow /drawings
Allow /demo
Allow /folder
Allow /start
Allow /spreadsheet
Allow /forms
Allow /macros
Allow /keep
Allow /static
Allow /drive/
Disallow /templateabuse
Disallow /

Other Records

Field Value
crawl-delay 1