Web Data Extractor
Type of Program: Utility - Link Extractor
Platforms: Win 95/98/2000/NT/ME/XP
Company Name: WebExtractor System
Installed Size: 510 KB
Have you ever sat there and looked at those sites that have link
pages that seem to have endless links and thought, "That must have taken a bit
of putting together"? Perhaps you want to put together a specific search engine
that will only have, say, recipe sites? It sounds an awful lot of work and time
to put together unless, that is, you have Web Data Extractor the answer to every
links page, search engine designer and webmasters prayers.
This program is a very powerful link extractor with attitude. Highly
configurable so you can extract exactly the right type of data you want for your
particular project and at the same time extremely easy to use particularly if
you view the "how to" section with clear examples in the excellent help file. It
will extract URLs, meta tags (title, description and keyword), email addresses,
phone and fax numbers from Web Sites, Search Engine Results, Web Groups or Dirs
and from a list of URLs.
Web Data Extractor provides high speed, multi-threaded, accurate data extraction
- and saves the data to disk file. The program has numerous filters to select
different options like - URL filter, date modified, file size, etc. It allows
user-selectable recursion levels, retrieval threads, timeout, proxy support and
many other options.
This is a typical example of what the program does with search engines:
"WDE will query 18+ popular search engines, extract all matching URLs from
search results, remove duplicate URLs and finally visits those websites to
extract data from there.
You can tell WDE how many search engines to use. Click "Engines" button and
uncheck the engines that you do not want to use. You can add other engine
sources as well.
WDE sends queries to search engines to get matching website URLs. Next it visits
those matching websites for data extraction. The depth it spiders in the
matching websites depends on "Depth" setting."
Data is saved in either CSV format or line by line text with the following
URL, Base, Domain, Title, Description, Keyword, Last Modified, Content Length
The emails are saved in a separate text file
This is a well designed and well authored piece of software that will be a
prayer answered for many a webmaster be they amateur or professional.
Ease of Installation:
Reviewed by Simon Baillie