robots.txt

What does a robots.txt do?

As the file extension already indicates, the robots.txt file is a human-readable text file. The purpose of robots.txt is to inform search engines such as Google or Bing that selected pages of a website may not be included in the search engine index. The technical details of a robots.txt follow the specifications of the robots exclusion standard.

However, the contents of a robots.txt are purely indicative. To ensure that the excluded parts of a website are not actually included in the search engine index, the relevant web crawlers must adhere to the specifications in robots.txt. In particular, it is not possible to use a robots.txt to protect the content of a website from being accessed by unauthorized persons.

Example of a robots.txt:

# robots.txt for example.com
# I exclude these web crawlers
User-agent: Sidewinder
Disallow: /

User-agent: Microsoft.URL.Control
Disallow: /

# These directories/files should not be
# be searched
User-agent: *
Disallow: /default.html
Disallow: /Temp/ # these contents will disappear soon
Disallow: /Privat/Familie/Geburtstage.html # Not secret, but should not be listed in search engines.

Further reference:

https://de.wikipedia.org/wiki/Robots_Exclusion_Standard

The following video explains the benefits of a robots.txt in connection with the Google search engine:

You might also be interested in

Technical SEO

Anchor text

Definition: Anchor text is the German equivalent of anchor text. Other synonyms are link text and reference text. Below you will learn what you can use anchor text for, how it is technically structured and how it is used for SEO.

Technical SEO

Canonical Tag

Definition: The canonical tag is a link element in the header of a page. It informs search engines where the original content is located (i.e. the URL). Only this should be indexed by the search engine. Several versions are created, for example, on dynamic websites when content is filtered. As all...

Technical SEO

WDF*IDF

What does WDF*IDF mean? WDF*IDF is a formula that can be used to calculate how often a term occurs in relation to your own document and "all available" documents on the Internet. WDF*IDF means "within document frequency" * "inverse document frequency", i.e. the frequency of the term in your own...

Technical SEO

Keyword

What is a keyword? In the context of SEO (search engine optimization), a keyword is a search term or search phrase that search engine users use to find information on the Internet. Keywords play a central role in search engine marketing. The central task of search engine optimizers is, as a first...

robots.txt

What does a robots.txt do?

Further technical terms

Technisches SEO

Other technical terms

From our blog

Vacancies

Employment agency

What does a robots.txt do?

Tag cloud

You might also be interested in

Further technical terms

Technisches SEO

Other technical terms

From our blog

Vacancies

Employment agency