MK Robot
- Version: 0.1
- Objective:
Socket implementation of a robot type structure for use with search engines.
- Description:
A robot is a software packet that helps implement the functionality of a search engine. A robot has three main properties. When given an HTML document as an input it:
- retrieves the document,
- indexes the document according to certain, programmer defined criteria,
- and stores it in a database of some type.
- More often than not, robots index an HTML document by its keywords and its description. Keywords and descritpion are placed in META tags in the head of the document. There are, however, robots that use different criteria. For the purpose of simplicity, version 0.1 of this robot only uses the document's keywords to determine its index.
- Implementation:
-
The entire robot structure consists of two parts. This C++ program is the main part. It is essentially a client-side application, implemented via Sockets(Sockets are used for communication between remote locations via a standard protocol, in this case a client and a server via the TCP/IP protocol). It establishes connection with the remote server, sends a package of data via a standard HTTP request, and then waits for a response. Once response is received, the data collected is stored in a temporary file and connection with server is closed. Control is then transferred to an Active Server Page(ASP), which performs certain operations on the data received from the remote server and stores it in another file.
- The Active Server Page is the second part of the robot structure, and it will be discussed in details in the ASP section of this site. In short, its primary function is to start the C++ executable file and then process the output.
- Click here to go directly to the search engine. Please follow the instructions for adding a site carefully.