Search notes:

Web crawling

TODO

https://www.kdsl.tu-darmstadt.de/de/kdsl/research-program/crawling-and-semantic-structuring/ tries to create a »combined, focused web crawling system that collects relevant documents from the web that is particularly suited for harvesting publications in the educational domain«
Heritrix

See also

web

Index