Image Recognition Using Deep Learning
Deep learning can be applied to many image processing and computer vision problems with great success. Using NetChain and NetTrain, you can define and train a neural network that categorizes a handwritten digit given an image.
Obtain training and validation data from the MNIST database of handwritten digits.
In[1]:=
data:image/s3,"s3://crabby-images/8c24d/8c24ddd9b7a79093100fde4d57e985da62323564" alt="Click for copyable input"
resource = ResourceObject["MNIST"];
trainingData = ResourceData[resource, "TrainingData"];
testData = ResourceData[resource, "TestData"];
In[2]:=
data:image/s3,"s3://crabby-images/b187a/b187ad6be86f2f82e77c4a8287584b87793a0416" alt="Click for copyable input"
RandomSample[trainingData, 5]
Out[2]=
data:image/s3,"s3://crabby-images/b28b4/b28b4ab0f8322226552cfc9e86d6235b8f34c5d6" alt=""
Design a convolutional neural network architected for recognizing 28×28 grayscale images.
In[3]:=
data:image/s3,"s3://crabby-images/46c81/46c81eac04898c38bd9e3aad23991514b7bb1b64" alt="Click for copyable input"
lenet = NetChain[
{ConvolutionLayer[20, 5], Ramp, PoolingLayer[2, 2],
ConvolutionLayer[50, 5], Ramp, PoolingLayer[2, 2], FlattenLayer[],
500, Ramp, 10, SoftmaxLayer[]},
"Output" -> NetDecoder[{"Class", Range[0, 9]}],
"Input" -> NetEncoder[{"Image", {28, 28}, "Grayscale"}]
]
Out[3]=
data:image/s3,"s3://crabby-images/66a74/66a74700f1cf06325535d0a8a0dd4a34a554e37e" alt=""
Train the network for three training rounds.
In[4]:=
data:image/s3,"s3://crabby-images/96493/964937d8c29d5691887586dd5075e6752f278e6d" alt="Click for copyable input"
lenet = NetTrain[lenet, trainingData, ValidationSet -> testData,
MaxTrainingRounds -> 3];
Out[4]=
data:image/s3,"s3://crabby-images/82e9b/82e9bf5d507252f793fa5b6e2a8eb57e653dd03e" alt=""
Evaluate the trained network directly on images randomly sampled from the validation set.
In[5]:=
data:image/s3,"s3://crabby-images/8bf7a/8bf7ab99184bd0f9c7c8f86006002e6ad2b87e78" alt="Click for copyable input"
imgs = Keys @ RandomSample[testData, 5];
Thread[imgs -> lenet[imgs]]
Out[5]=
data:image/s3,"s3://crabby-images/31614/316141386b900da7b0cfbfe4f2e6eaab61efba63" alt=""