CIS006-2: Concepts and Technologies of Artificial Intelligence Assignment Sample 2022

Introduction

Our primary task in this assignment is to use the strategies such as random search and Meta learning to optimize the structure and parameters of Artificial Neural Networks (ANN) on the given problem. Optimization is essential to maximize the recognition accuracy of ANNs which is designed to solve the tasks of biometrics. Out of many algorithms for optimization, Random search will provide a computationally effective to produce and offer more sophistication in optimization. Meta learning algorithms are accounting for the meta-learning structure, but it solves the complex problems in optimization.

Designing a Solution

Here we are using Google Colab to perform the optimization task. Let us start with loading of data to Google Colab in correct shape (m, nh, nw, c) as numpy array.

The following guidelines have to follow for the optimization of neural networks.

Initialize with learning rate;

After, try the number of hidden units, mini batch size and momentum term,

Then, tune the number of layers and learning rate decay.

These are the good tips. Intuition is needed to create customizable NN in Python. In Google Colab, mostly all the libraries are installed. As we have to train the NN, it is essential to use GPU to make speedy process of training. (Liu, 2019) . The overall procedure for optimization is given as follows:

Since the learning rate is considered as important aspect to tune, it can give large enhancements in performance without influencing the time of training. If the batch sizes are small then it will offer better outcomes, though it consumes time. Likewise, training to more number of epochs usually assists to enhance the accuracy, but the time and cost will be higher.

Optimizer is considered as essential parameter for tuning. Deeper as well as wider neural networks are not assisting always in optimization. Standardization of features could enhance the performance of the model and it is easy while comparing with tuning of parameters.

Neural networks are always great, but it is not suitable for everything. The time towards training and tuning of the model NN could take thousand times excess than non-Neural networks. They are considered as best fit to the use cases like computer vision (CV) and the NLP – Natural Language Processing.

Data set

The dataset we are utilizing for this optimization task is same as for assignment 1. The data set consists of 5236 features it will take long time to train and tune the parameters to attain higher accuracy model. Hence the features are reduced from our dataset but maintaining the variance of our data by utilizing the Principal component Analysis (PCA) The variance will be maintained by about 99% of dataset, though the features are reduced to 190. This process is otherwise known as pore-processing.

This can be done in various stages such as converting the dataset into single dimensional array for better processing of NN; scaling of images to handle the data in easier; dividing the data into training and testing to evaluate the performance of model on testing set at the time of training the model on train set; Final process is the incorporation of PCA which is known as a method for dimensional reduction which is used to reduce the features of dataset without affecting the quality.

Random Search Optimization

This is denoted as random optimization else random sampling. It involves in producing as well as evaluating the inputs in random to function of objective. It is effective due to the fact that, it is not assuming anything regarding objective function’s structure. This would be beneficial for solving the problems; it has more domains that it may impact the strategy of optimization and permitting to find the solutions to non-intuitive.

(Brownlee, 2021) This Random search is considered as excellent strategy for more complicated problems with discontinuous areas or noisy of search space which could cause the algorithms which depends on gradients which are reliable. We could generate the samples of random numbers from the domain by utilizing g pseudorandom number generator. Every variable needs well-defined range as well uniform value that can be sampled from the range and then it is to be evaluated.

Generating the samples is is insignificant and it’s not taking more memory hence it might be efficient to generate more input samples, and then we can make evaluation. Every sample is considered as independent, hence it can be evaluated by aside if required to accelerate.

Meta Learning

This is referred to machine learning to learn the algorithms which could be learning from other kinds of learning algorithms. It means the usage of ML algorithms which tends to learn how to combine the predictions in best way from other ML algorithms in ensemble learning field.

Meta-learning is refers to the selection of model by manually and the tuning is performed by experts on ML projects which makes advanced ML to be automatic. It refers to the learning through multiple associated predictive tasks modeling which is otherwise known as learning by multi-task. (Brownlee, What is Meta- learnign in ML, 2020) The algorithms of Meta-learning learn from results of other ML algorithms which learn from the data.

It means, this needs the presence of other algorithms of learning which have been previously trained on the data. It makes the predictions from the results of existing ML algorithms as its input and predicts the labels of class. Meta-learning is referred to the issues of multi-task learning. Automl is not referring to Meta-learning; though the algorithms might harness the meta-learning through the task of learning which is known as learning towards learn.

It involves in discovering the procedure of preparation of data, algorithm learning and hyperparameter learning which gives best results in performance scores metric of testing the harness.

The procedure of optimization is executed by the human. In the process of optimization, it maximizes the metric of execution otherwise minimizes the predicting error.

Experiments

In this section we are going to implement the models for optimization using Random search and Meta learning. Let us start the process from initial process of data uploading and pre-processing.

CIS006-2: Concepts and Technologies of Artificial Intelligence

It shows the images of dataset and importing it in numpy as np. There are 1500 images in the data set for experimental purpose. From Keras the pre-processing of data can be done. Here the datasets are Biometric images of face. We are using Google colab for my experimental works.

CIS006-2: Concepts and Technologies of Artificial Intelligence

It shows the loading of converted images after the process of pre-processing. Now the data set is loaded. The bio metric images are faces npz. The process of normalizing is dividing the each pixel value of the image by 255, as the value 255 is referred as highest pixel. Then the images are loaded for labeling to train and test the data.