SOLAR

Intelligent Systems Research Group

 

Motivation
People

Description
Publications

Links
Contact

 

 

 

 

  Primary Visual Cortex from Gray Matters: The Brain Movie

http://www.psc.edu/biomed/brainmovie/


 


Motivation:

A data driven Self Organizing Learning Array (SOLAR) is proposed and its hardware structures are being developed in this project.  It has three advantageous features over a typical neural network technology: online learning; local interconnections; entropy based local selection of the optimum transformation functions and threshold values. This learning array can perform intelligent tasks such as pattern recognition, prediction and modeling of unknown systems without being programmed beforehand, and find a wide range of applications from security to everyday life, from military to commercial.

The images are from the web page http://occipita.cfa.cmu.edu/brain/public/

 
TOP

 

People

Janusz Starzyk: Prof. of EECS, Ohio University.

Annie Shen: Asst. Prof. of Mathematics, Ohio University.

Jim Zhou: Prof. of EECS, Ohio University.

James Graham: MS Student 2004-

Lily Li: MS Student 2004-

Yinyin Liu: PhD Student, 2003-

Haibo He: Ph.D. Student, 2002-
Zhen Zhu: Ph.D. Student, 2002-

 

Former members:

Mingwei Ding: Ph.D. Student, 2000-2005
Yongtao Guo: Ph.D. Student, 1998-2004
Tsun-ho Liu: MS Student, 2000-2002
Jing Liang: Ph.D. Student, 2000-2003
Jian Feng: Ph.D. Student, 2001-2002
Feng Wang: MS Student, 2002-2004
Zhineg Zhou: Ph.D. Student, 2002-2004

TOP


Description

The basic unit of the learning array is called a processing component, which is randomly pre-wired with other local processing components to form this learning architecture. Each processing component has the capability to select input data source from the wired connections and perform different transformation functions on the selected data. In this architecture, every processing component will learn concurrently and choose its own optimum transformation function to reach a maximum value of information index based on the entropy calculation. It will select the threshold value for the output signals. The output of one processing component is sent to its neighbors. General organization of this array is illustrated in figure 1.

Fig.1 Architecture view of SOLAR

Although the existing applications of a SOLAR machine are limited to task typical to artificial neural networks our objective it to build a machine which can exhibit an intelligent behavior and make associations between different events observed by machine.  SOLAR is different from organization of neural networks in several important aspects:


source:http://www.pbs.org/wnet/brain/
In order to accumulate learning results from different subspaces, we need to consider what is the amount of added learning and weight it against increased system complexity and resulting errors of statistical learning.  This is the case when a set of training data is obtained from a small subspace of the original space and therefore it is related to less reliable statistics about the training data. Let us define the subspace s information deficiency (normalized relative subspace entropy) .

Information deficiency indicates how much knowledge must be gained to resolve the classification problem in the given subspace.  Initially, when a neuron receives its data from the NN primary inputs (raw signal data) we assume that the information index is zero, which means that the input information deficiency for the first layer of neurons is equal to 1.

Therefore, each subspace can be separately learned by minimizing its information deficiency.  If a new subspace is subdivided, a new information index Is in this subspace is obtained.  In order to select which feature gives the largest local improvement in overall information we must maximize the information deficiency reduction (IDR).

Several neurons can be mapped  into a single Virtex chip.  By connecting a large number of such chips we can grow the complexity and computing power of the resulting SOLAR system.  Conceptual illustration of such mapping is shown on Fig. 2, where several neurons are mapped to an FPGA board.

                
Fig.2 Mapping neurons to Virtex

Wiring

In SOLAR architecture, initial connections are pseudo-randomly defined. Figure 3 shows an example of the initial pseudo-random connections among neuron units in SOLAR that corresponds to a software generated configuration sequence.

Fig. 3 Pseudo-random connections

System Hardware

The SOLAR system is being designed as a 3D block of processing units using 386 XCV1000 BG560 chips donated to us from Xilinx (http://www.xilinx.com/).  Chips are organized into boards 6 chips per board and boards are stacked together and connected in 3D system.

A single PCB board contains 4 interconnected VIRTEX XCV1000 FPGA chips. All of the chips have identical architecture.  They are interconnected in parallel via several hundred generic I/O pins provided by VIRTEX XCV1000 FPGA chip.  The layout of the prototyping PCB for SOLAR is shown on Fig. 4a)

Fig. 4a) PCB layout.

Fig. 4b) shows an assembly of three PCB hosting 12 XCV1000 chips.  This configuration is expandable in 3 dimensions, for a significant growth of the computational power of the entire SOLAR system.

 

Fig. 4b) PCB assembly.

The initial configuration information is transferred into every single chip one by one via JTAG port.  Then, the layout-identical PCB boards are expanded to a rack architecture including 4 PCBs with twenty-four VIRTEX XCV1000 chips as shown in the left part of Fig. 5. These PCB boards are interconnected vertically and configured in a broadcast fashion – the identical configuration bits are sent to every PCB board, while the FPGA chips are programmed via JTAG port in a daisy chain way.

The main PCB boards are interfaced to a PC through the interface boards Fig. 4c).  Each interface board will have two Virtex chips  and the data and configuration busses.  The Virtex chips on the interface board are connected to these buses for the reception of input data and the chip’s configuration information.  The buses are 12-bit wide and are connected to the SCSI bus through switches.  The buffers on the two buses guarantee the signal integrity providing the signal propagation to the next board. There are three connectors at each board, providing the interface board with expandability in 2D.

Fig. 4c) PCB assembly.

Finally, 3D SOLAR system as shown in the right part of Fig. 5, containing 4x4 SOLAR racks with close to 400 million gates, will be built to implement a network.

 System can be expanded in any direction as needed.  A conceptual organization of this system is as illustrated:

 
  Fig. 5 3D cube SOLAR architecture.

Neurons Operation

Neurons receive input data from its neighbors or primary inputs as well as thresholded clock control signal TCI which indicates whether the data is from the subspace in which this particular neuron performs classification task.  In addition an input information deficiency if provided for the neurons subspace.  On the system clock the neuron processes its inputs according to selected transformation function and compares result with a learned threshold.  Neuron's output contains transformed input data (usually fed to other neurons) as well as thresholded clock information for its subspaces (TCOT and TCOTI).  These are thresholded clock signals which indicate that input data from neuron's subspace either satisfied or did not satisfy neuron's threshold.  Information deficiencies for neuron's subspaces is also provided.  Neuron also copy TCI and input information deficiency to TCO and output information deficiency for data which were not classified by this neuron.  In this case neuron only transforms the input data without separation.

Fig. 6 Illustration of neurons concurrent operation.

Below is the illustration of subspace learning by a single neuron obtained in a synthetic  2D  5-class data set classification problem in SOLAR structure.  Final classification is based on combination of separating functions from many neurons in the array.


Fig.7a) Simulation result for data classification

A detail of two neurons separating a single class from this set is shown on Fig. 7b)

Fig.7b)  Two Neurons Separating Class 2 (Zoom In).

A network of such neurons can be trained to recognize patterns, store images, and make associations.  Activity af neurons is triggered based on associations between neuron firing during the training stage as shown on Fig. 8.

Fig.      Neuronal outputs and dendrite trees of the training input neurons.

 

For a reference to real life classification problems two benchmark data sets were analyzed by the SOLAR structures.  They are credit card and adult income data sets from Machine Learning Repository, available at FTP: Host name: ftp.ics.uci.edu directory: /pub/machine-learning-databases/
In both cases SOLAR demonstrated its ability to organize its hardware and to classify learned data sets with good performance.

Credit Card Data Set

The credit card approval data in Australia (Credit Screening Database) was acquired from the University of California at Irvine (ICS, UCI, 1995, December).  The data set has 690 instances, 16 features including class attribute, and it is divided into 2 classes, which are approve and reject.  The data set contains numbers of credit card applications, which feature names and values have been replaced by symbols to protect the individuals’ privacy of the data.  Fig. 9 shows the SOLAR structure evolved after learning and to the right are classification results with reference to cited in literature other classification algorithms. (Michie, D., Spiegelhalter, D. J., and Taylor, C. C. (1994), Machine Learning, Neural and Statistical Classification, Ellis Horwood Limited, London, U.K.).  Note that SOLAR lost only to Cal5 algorithm defeating all the neural network algorithm on this list (highlighted in yellow color).


Fig. 9  SOLAR structure for Credit Card problem
                                Classification results


Algorithm
Error Rate
Cal5
0.131
SOLAR
0.1333
Itrule
0.137
Discrim
0.141
Logdisc
0.141
DIPOL92
0.141
CART
0.145
RBF
0.145
CASTLE
0.148
NaiveBay
0.151
IndCART
0.152
Backprop
0.154
C4.5
0.155
SMART
0.158
Baytree
0.171
k-NN
0.181
NewID
0.181
AC2
0.181
LVQ
0.197
ALLOC80
0.201
CN2
0.204
Quadisc
0.207
Default
0.440
Kohonen
Failed

Adult Income

This adult income data set (Adult Database) was obtained from the University of California at Irvine (ICS, UCI, 1995, December).  The data set contains two sets of data.  One is training data, which has 32561 instances while another one is testing data, which has 16281 instances.  Both have 15 features including class attribute, and they are also divided into 2 classes.  The data set contains both symbolic values such as gender, race, etc., and missing data.  Fig. 10 shows the SOLAR structure evolved after learning and to the right are classification results with reference to cited in literature other classification algorithms.  As it appears from larger complexity of the evolved neurons structure and higher classification errors, this problem seems to be more complex than the credit card application problem.  Note that that there is not a single neural network algorithm on this list.
 

Fig. 10  SOLAR structure for Adult Income problem
Classification results


Algorithm
Error Rate
FSS Naïve Bayes
0.1405
NBTree
0.1410
C4.5-auto
0.1446
IDTM (Decision table)
0.1446
HOODG / SOLAR
0.1482
C4.5 rules
0.1494
OC1
0.1504
C4.5
0.1554
Voted ID3 (0.6)
0.1564
CN2
0.1600
Naïve-Bayes
0.1612
Voted ID3 (0.8)
0.1647
T2
0.1687
1R
0.1954
Nearest-neighbor (3)
0.2035
Nearest-neighbor (1)
0.2142
Pebls
Crashed

TOP



 

Publications (see also)

  • Software Simulation of a Self-organizing Learning Array System
  • Future Hardware Realization of Self-Organizing Learning Array and Its Software Simulation


    The images are from the web page http://occipita.cfa.cmu.edu/brain/public/

    This colored scanning electron micrograph shows the synapses, or connections, between two nerve fibers (in purple) and a nerve cell (yellow). The picture is magnified 10,000 times (source: http://www.pbs.org/wgbh/nova/mind/electric.html)

    New Version

    Contact

    This page is maintained by: Mingwei Ding.  If you have question about this webpage please email  me here
    TOP