Access some class-notes and programs etc. on Computers in Chemistry 
Computational modeling for oligonucleotides obtainable from the PDB resource
Read the related longer essay Computational Chemistry: As I have Seen this Creature
Story of My Successful Struggle with Windows-Variety PC-GAMESS:
A real-life story

Proceed to download DblClkPCGAMESS, the tiny PC-GAMESS helper package developed by this author [keep gamess.exe, fastdiag.dll etc. files of the PC-GAMESS package in the folder c:\pcgamess, extract this tiny downloaded .zip file anywhere, then paste the resulting extracted files (3 or 4 files) to the folder c:\pcgamess - after this, to run PC-GAMESS (on the molecule specified in the input file Work2Do.inp), only a double-click need be made on the DblClk to Run PCGAMESS shortcut].
Some input examples for PC-GAMESS (Firefly) and their explanations


Computational Chemistry via Educational Tools:
Modelling Molecules and Reactions
(Delivered at Workshop on Computers in Chemistry, June 2006, Cotton College, Guwahati)

Rituraj Kalita

Computational chemistry attempts making models or simulations of the chemical species and chemical processes in the computer. As any other branch of science, it also use such models to interpret the naturally occurring chemical phenomena, and tries to predict yet undiscovered chemical phenomena as well.

Within this branch of science, let us discuss the ab initio quantum mechanical modelling of small and medium-sized molecular systems in some details, considering its current popularity. In this field, the mathematical (numerical) model of the molecule is always associated with a 3-D mobile visual model to be viewed by the user. Given the numerical model, the corresponding visual model may be obtained using a graphics software (e.g., PCModel, ArgusLab, ORTEP-3 etc.), while every visual model drawn or generated anyhow can be saved as (transformed into) the corresponding numerical model. The visual model helps in easy understanding of the molecular structure and stereochemistry; it attempts to represent the actual molecule as closely as possible in its shape and stereochemistry. There are generally provisions to view the models in different styles such as stick, ball and stick, electron dot surface etc. Besides, generally the visual model may be viewed from different angles (orientations) and in different enlargements. So, the 2-D structural figures drawn on paper (or in Windows Paint/ ISIS-Draw etc. drawing software) are thus rather poor visual models, compared to those made with such chemical modelling packages.

As a starting point, in this field we start with molecules lying in the vacuum (gas phase); it is possible to introduce corrections to this gas-phase model for the surrounding solvent medium etc. From quantum mechanics, we know that there is no question of the electrons in the molecule to be specified of their positions: we need to specify only the number of electrons in the molecule, while the electron probability density will be obtained from solution of the Electronic Schrödinger Equation (ESE). On the other hand, there’s the necessity of specifying the positions of all the nuclei (in addition to specifying the types and numbers of the nuclei) – as from the Born-Oppenheimer approximation we know that to construct the ESE the nuclear framework (arrangement) must be specified. Besides, with the same set of nuclei and the same number of electrons different isomers may arise, if the relative nuclear positions are allowed to vary. Thus, the molecular model must include the nuclear coordinates, in addition to the types and numbers of nuclei and the total number of electrons. Using such a molecular model, the molecular electronic wavefunction and the electron probability density can be directly found (it’s just a matter of time) by solving the ESE, thereby arriving at a complete description of the molecule.

However, the total number of electrons may not be explicitly mentioned in the molecular model (say, in the .xyz format used below), in which case it is understood that the molecule is electro-neutral i.e., there are just sufficient number of electrons (say 26 in ethanol) to keep the molecule uncharged. In other cases (say, for GAMESS) the charge of the molecular system needs to be specified, meaning that the number of electrons thus gets understood. Coming to the question of nuclear-framework specification, we see that the nuclear-framework part of the molecular model is specified in mainly two different formats. In one format, the type (say atomic symbol, such as H or N) of each of the nuclei along with its Cartesian (x, y, z) coordinates (separated by space) are specified one by one for all the nuclei. Generally, the molecular model is in the form of a text file, with one line each dedicated to the description of each of the nuclear type & position. The unit of the coordinates is, practically universally, Angstrom (not atomic unit). Thus, the nuclear framework specification for a water molecule may be as follows:

O          0.000000           0.127174          0.000000
H          0.758132          -0.508697          0.000000
H         -0.758132          -0.508696          0.000000

In the other format called the z-matrix specification, the position of the first nucleus is kept unspecified. The position of the 2nd is expressed in terms of the distance from the 1st. The position of the 3rd is expressed in terms of the distance from the 2nd or the 1st, and in terms of the bond angle amidst the 3rd, the 2nd/ 1st, and the 1st/ 2nd nucleus. All the rest of the nuclear positions require specification of one bond distance, one bond angle and one dihedral angle (i.e., angle between two planes, say between the 1-2-3 and the 2-3-4 nuclei-connecting planes). The justification of such a (initially incomplete-looking) z-matrix specification is obvious: it is because translating the whole nuclear framework to another position or rotating it doesn't lead to a different molecule! It is the Cartesian coordinate specification format where there is rather too much of coordinates specification (over-specified by six degrees of freedom), but there also it is understood that translation or rotation of the whole framework creates no different molecule. This second format of nuclear framework specification also has each line describing one nucleus, with unit of distance and angle being Angstrom and degrees by convention, as exemplified for H2O:

O
H         1          0.989493
H         1          0.989492           2          100.024728

In this branch of science, one naturally starts with the preparation of the molecular model. Using model-drawing software packages such as ArgusLab (a free software) or PCModel etc., such models may either be drawn from scratch OR be modified from pre-existing models such as of aliphatic/ aromatic rings, amino-acids, mono-saccharides, nucleic-acid bases etc. Such drawing or modification generally involves quite user-friendly steps, as may be exemplified in case of ArgusLab. In ArgusLab, to add a bond, one needs to left-click at the starting nucleus, and then right-click at the new nuclear position (after making sure that its Auto Bonds toolbar-button is set to ON). To delete an existing nucleus, one needs to right-click at the nucleus, then left-click at the menu-item Delete Atom. A new nucleus drawn is assumed as a sp3 hybridized C-atom, which assignment may be altered from a periodic-table atom-list. The H-atoms needn't be drawn; they're just understood and may be shown/ hidden at will. Gross structural mistake(s) made in drawing or modification (e.g., creation of absurd bond-lengths etc.) may now be corrected by invoking an inbuilt, raw energy-minimization procedure called Clean Geometry. After drawing or modification of the visual model in this way, the mathematical model may be immediately obtained in different formats such as XYZ (one of the former class, as above) or Brookhaven PDB (Protein Data Bank, a format incorporating some more information) etc.

The mathematical models thus formed are then fed into computational software packages such as PC-GAMESS or Gaussian etc., the desired level of computational theory (say, Huckel/ Hartree-Fock/ Moller-Plasset 2nd-Order etc.) is specified or kept understood, and the lengthy computational process is allowed to go on. Through such extensive computations, modern computational packages such as PC-GAMESS or Gaussian can predict many properties and reactions of molecules such as: molecular energy & structures, transition-state energy & structures, vibrational frequencies, reaction energies, electron density distribution, potential energy surface (PES) & reaction pathways etc. The greatly popular free package PC-GAMESS (please note that if you are working in, say, a Pentium-IV single-processor single-PC system under Windows, be sure to download only the Windows, Sequential, Optimized for P-IV form of PC-GAMESS), is fundamentally a DOS-fashioned one that may run in the command prompt (within its folder) via a DOS-command of the sort gamess -i Work2do.inp -o Output.txt where Work2do.inp is a characteristic PC-GAMESS input file (with contents similar to those shown below) and Output.txt is its program output file. However, user-friendly graphical user interfaces (GUIs) has been developed that includes the celebrated RunPCG available from the PC-GAMESS website itself. This author, however, prefers working with his self-developed tiny (5-kB) package (incorporating a batch file DClkPCGM.bat, the working link file DblClk to run pcgamess, and a well-illustrated Work2do.inp); this tiny package may be free-downloaded from the top of this web-page of this essay (you might as well like to read the story of its development).

It may be noted here that such a computational package may perform computations in two different class of ways: (i) the nuclear-framework may be considered as exactly fixed and so no modification is attempted into it (called a single-point calculation) OR (ii) the nuclear-framework is considered to be modifiable, and so the optimum molecular structure is sought for through its modification (called a structure-optimization calculation).  Through the second way we may easily look for theoretical prediction of chemical reactions, because we know that it is the change in the relative nuclear coordinates that imply a chemical reaction (e.g., the reaction H2 (g) + I2 (g) = 2HI (g) implies that the H–H  & I–I distances have increased much and the H–I distances have decreased greatly).

How exactly is the reaction between two or three molecules modelled? To proceed with such modelling, at first the model of a relevant supermolecule, which includes all the reacting molecules, has to be constructed by joining the individual molecular models. For example in case of the H2-I2 reaction, the supermolecule must include the combination of an H2 molecule and an I2 molecule. For exact considerations unlike the simple-minded structure-optimization example mentioned above, a PES (nuclear potential energy U as a function of relative nuclear coordinates) calculation has to be planned for and performed, then the reaction pathways to be predicted, leading to prediction of even the rate constants.  

Displayed below are the contents of a simple PC-GAMESS input file for a (single) formaldehyde molecule. Note the atomic numbers (6.0, 8.0, 1.0 etc.) specified in the 2nd column that distinguishes the Cartesian (nuclear) coordinates in PC-GAMESS input-file from those in the .xyz file (generated by ArgusLab). To form a PC-GAMESS input file for another molecule, you may, at this beginner's stage, manually create this 2nd column.

 $CONTRL SCFTYP=RHF RUNTYP=OPTIMIZE COORD=CART 
 MULT=1 ICHARG=0 $END
 $SYSTEM TIMLIM=20000 MEMORY=10000000 $END
 $STATPT NSTEP=1000 $END
 $BASIS GBASIS=STO NGAUSS=3 $END
 $GUESS GUESS=HUCKEL $END
 $DATA
Test...HCHO molecule - RHF/STO-3G (a comment line)
Cn 1

 C    6.0     0.6084782705     -0.0000011694    0.00000000000
 O    8.0    -0.6082418894     -0.0000002093    0.00000000000
 H    1.0     1.2040919862     -0.9264398115    0.00000000000
 H    1.0     1.2040973125      0.9264340484    0.00000000000
 $END

What are the implications and advised-values for the mysterious keywords above? How to implement in PC-GAMESS theories beyond this simple RHF level and basis-sets (set of AO-equivalents) beyond this simple STO-3G set? Click here for the answers.


Related download-sites and references:

planaria-software.com and arguslab.com (for the modelling package ArgusLab)
classic.chem.msu.su/gran/gamess (for the computational package PC-GAMESS, presently termed Firefly)