##ACAP version=1.0 # Legacy robots.txt content. User-agent: Mediapartners-Google* Disallow: User-agent: * Disallow: /cgi-bin/ ###Disallow: /images/ Disallow: /img/ Disallow: /special/ # Un-comment the line below, if crawlers capable of understanding # ACAP records are to ignore conventional records # ACAP-ignore-conventional-records # ACAP local definitions # Resources found in three directories are crawlable # ACAP-resource-set: crawlable /public/ /promotion/ /news/ ACAP-resource-set: notcrawlable /cgi-bin/ /images/ /img/ /special/ # On this site .cache. means .preserve (store) until re-crawled. # ACAP-qualified-usage: cache preserve time-limit=until-recrawled # The same usages are permitted for all resources in the specified # resource set, so we can define a composite usage # ACAP-composite-usage: basic-usages crawl index present # Crawlers in general are prohibited to crawl this site ACAP-crawler: * ACAP-allow-crawl: / time-limit=until-recrawled ACAP-disallow-crawl: the-acap:resource-set:notcrawlable # All my usages are permitted for the specified resource set... # ACAP-allow-(basic-usages): the-acap:resource-set:crawlable # which is equivalent to permitting the three separate usages, which are # commented out here... # ACAP-allow-crawl: the-acap:resource-set:crawlable # ACAP-allow-index: the-acap:resource-set:crawlable # ACAP-allow-present: the-acap:resource-set:crawlable # ...but may only preserve copies in the locally-defined sense # ACAP-allow-(cache): the-acap:resource-set:crawlable