CNN for Visual Recognition (assignment1

参考：http://cs231n.github.io/assignment1/

Q1: k-Nearest Neighbor classifier (30 points)

 import numpy as np

 from matplotlib.cbook import todate

 class KNearestNeighbor:

   """ a kNN classifier with L2 distance """

   def __init__(self):

     pass

   def train(self, X, y):

     """

     Train the classifier. For k-nearest neighbors this is just

     memorizing the training data.

     Input:

     X - A num_train x dimension array where each row is a training point.

     y - A vector of length num_train, where y[i] is the label for X[i, :]

     """

     self.X_train = X

     self.y_train = y

   def predict(self, X, k=1, num_loops=0):

     """

     Predict labels for test data using this classifier.

     Input:

     X - A num_test x dimension array where each row is a test point.

     k - The number of nearest neighbors that vote for predicted label

     num_loops - Determines which method to use to compute distances

                 between training points and test points.

     Output:

     y - A vector of length num_test, where y[i] is the predicted label for the

         test point X[i, :].

     """

     if num_loops == 0:

       dists = self.compute_distances_no_loops(X)

     elif num_loops == 1:

       dists = self.compute_distances_one_loop(X)

     elif num_loops == 2:

       dists = self.compute_distances_two_loops(X)

     else:

       raise ValueError('Invalid value %d for num_loops' % num_loops)

     return self.predict_labels(dists, k=k)

   def compute_distances_two_loops(self, X):

     """

     Compute the distance between each test point in X and each training point

     in self.X_train using a nested loop over both the training data and the

     test data.

     Input:

     X - An num_test x dimension array where each row is a test point.

     Output:

     dists - A num_test x num_train array where dists[i, j] is the distance

             between the ith test point and the jth training point.

     """

     num_test = X.shape[0]

     num_train = self.X_train.shape[0]

     dists = np.zeros((num_test, num_train))

     for i in xrange(num_test):

       for j in xrange(num_train):

         #####################################################################

         # TODO:                                                             #

         # Compute the l2 distance between the ith test point and the jth    #

         # training point, and store the result in dists[i, j]               #

         #####################################################################

         dists[i,j] = np.sqrt(np.sum(np.square(X[i,:] - self.X_train[j,:])))

         #####################################################################

         #                       END OF YOUR CODE                            #

         #####################################################################

     return dists

   def compute_distances_one_loop(self, X):

     """

     Compute the distance between each test point in X and each training point

     in self.X_train using a single loop over the test data.

     Input / Output: Same as compute_distances_two_loops

     """

     num_test = X.shape[0]

     num_train = self.X_train.shape[0]

     dists = np.zeros((num_test, num_train))

     for i in xrange(num_test):

       #######################################################################

       # TODO:                                                               #

       # Compute the l2 distance between the ith test point and all training #

       # points, and store the result in dists[i, :].                        #

       #######################################################################

       dists[i, :] = np.sqrt(np.sum(np.square(self.X_train - X[i,:]), axis=1))

       #######################################################################

       #                         END OF YOUR CODE                            #

       #######################################################################

     return dists

   def compute_distances_no_loops(self, X):

     """

     Compute the distance between each test point in X and each training point

     in self.X_train using no explicit loops.

     Input / Output: Same as compute_distances_two_loops

     """

     num_test = X.shape[0]

     num_train = self.X_train.shape[0]

     dists = np.zeros((num_test, num_train))

     #########################################################################

     # TODO:                                                                 #

     # Compute the l2 distance between all test points and all training      #

     # points without using any explicit loops, and store the result in      #

     # dists.                                                                #

     # HINT: Try to formulate the l2 distance using matrix multiplication    #

     #       and two broadcast sums.                                         #

     #########################################################################

     tDot = np.multiply(np.dot(X, self.X_train.T), -2)

     t1 = np.sum(np.square(X), axis=1, keepdims=True)

     t2 = np.sum(np.square(self.X_train), axis=1)

     tDot = np.add(t1, tDot)

     tDot = np.add(tDot, t2)

     dists = np.sqrt(tDot)

     #########################################################################

     #                         END OF YOUR CODE                              #

     #########################################################################

     return dists

   def predict_labels(self, dists, k=1):

     """

     Given a matrix of distances between test points and training points,

     predict a label for each test point.

     Input:

     dists - A num_test x num_train array where dists[i, j] gives the distance

             between the ith test point and the jth training point.

     Output:

     y - A vector of length num_test where y[i] is the predicted label for the

         ith test point.

     """

     num_test = dists.shape[0]

     y_pred = np.zeros(num_test)

     for i in xrange(num_test):

       # A list of length k storing the labels of the k nearest neighbors to

       # the ith test point.

       closest_y = []

       #########################################################################

       # TODO:                                                                 #

       # Use the distance matrix to find the k nearest neighbors of the ith    #

       # training point, and use self.y_train to find the labels of these      #

       # neighbors. Store these labels in closest_y.                           #

       # Hint: Look up the function numpy.argsort.                             #

       #########################################################################

       # pass

       closest_y = self.y_train[np.argsort(dists[i, :])[:k]]

       #########################################################################

       # TODO:                                                                 #

       # Now that you have found the labels of the k nearest neighbors, you    #

       # need to find the most common label in the list closest_y of labels.   #

       # Store this label in y_pred[i]. Break ties by choosing the smaller     #

       # label.                                                                #

       #########################################################################

       y_pred[i] = np.argmax(np.bincount(closest_y))

       #########################################################################

       #                           END OF YOUR CODE                            #

       #########################################################################

     return y_pred

输出：

Two loop version took 55.817642 seconds
One loop version took 49.692089 seconds
No loop version took 1.267753 seconds

CNN for Visual Recognition (assignment1_Q1)的更多相关文章

CNN for Visual Recognition (01)
CS231n: Convolutional Neural Networks for Visual Recognitionhttp://vision.stanford.edu/teaching/cs23 ...
CNN for Visual Recognition (02)
图像分类参考:http://cs231n.github.io/classification/ 图像分类(Image Classification),是给输入图像赋予一个已知类别标签.图像分类是计算机 ...
论文笔记之： Bilinear CNN Models for Fine-grained Visual Recognition
Bilinear CNN Models for Fine-grained Visual Recognition CVPR 2015 本文提出了一种双线性模型( bilinear models),一种识 ...
大规模视觉识别挑战赛ILSVRC2015各团队结果和方法 Large Scale Visual Recognition Challenge 2015
Large Scale Visual Recognition Challenge 2015 (ILSVRC2015) Legend: Yellow background = winner in thi ...
【论文阅读】Deep Mixture of Diverse Experts for Large-Scale Visual Recognition
导读: 本文为论文<Deep Mixture of Diverse Experts for Large-Scale Visual Recognition>的阅读总结.目的是做大规模图像分类 ...
目标检测--Spatial pyramid pooling in deep convolutional networks for visual recognition(PAMI, 2015)
Spatial pyramid pooling in deep convolutional networks for visual recognition 作者: Kaiming He, Xiangy ...
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition Kaiming He, Xiangyu Zh ...
Convolutional Neural Networks for Visual Recognition 1
Introduction 这是斯坦福计算机视觉大牛李菲菲最新开设的一门关于deep learning在计算机视觉领域的相关应用的课程.这个课程重点介绍了deep learning里的一种比较流行的模型 ...
【CV论文阅读】+【搬运工】LocNet: Improving Localization Accuracy for Object Detection + A Theoretical analysis of feature pooling in Visual Recognition
论文的关注点在于如何提高bounding box的定位,使用的是概率的预测形式,模型的基础是region proposal.论文提出一个locNet的深度网络,不在依赖于回归方程.论文中提到locne ...

随机推荐

swift 注意事项 (十六) —— 可选链
可选链(Optional Chaining) 我们都知道"可选型"是什么.那么可选链又是什么,举个样例解释一下: struct MyName{ var name } st ...
Struts2 整合jQuery实现Ajax功能(2)
1.1.1 Action利用struts2-json-plugin-X.X.X.jar响应Json格式信息: 1. function removerecordbyid(recordid) ...
Windows 10技术布局，谈微软王者归来
Windows 10技术布局,谈微软王者归来每个时代都有王者,王者的成功,往往是因为恰逢其时地发布了一个成功的产品(具有里程碑意义,划时代的产品).Windows 95的成功标示着微软是PC时代的王 ...
winhec
#winhec# 开发人员刷屏看点 (视频) 今天大家已经被winhec刷屏了,本来不想写这篇了,但看了所有的文章,大家关注的都是windows 10的那些新功能,小米win10刷机,联想千元手机,小 ...
POJ 1035 代码+具体的目光
Spell checker Time Limit: 2000MS Memory Limit: 65536K Total Submissions: 19319 Accepted: 7060 Descri ...
Java设计模式偷跑系列(六)Singleton模式的建模与实现
转载请注明出处:http://blog.csdn.net/lhy_ycu/article/details/39784403 单例模式(Singleton):是一种经常使用的设计模式. 在Java应用中 ...
工作流设计参考（包括PHP实现）
工作流很少有让人满意的,即便是国内用的比较多的jbpm,用起来也会觉得很便扭.再加上PHP中没有什么好用的工作流,于是干脆自己设计一个,设计的原则如下: 1 根据80/20原则,只使用wfmc模型中最 ...
软件开发人员真的了解SQL索引吗(索引使用原则)
原文:软件开发人员真的了解SQL索引吗(索引使用原则) 前两篇文章我总结了一些SQL数据库索引的问题,这篇主要来分析下索引的优缼点,以及如何正确使用索引. 索引的优点:这个显而易见,正确的 ...
Visual Studio 单元测试之五---数据库测试
原文:Visual Studio 单元测试之五---数据库测试数据库的单元测试主要是测试数据库中的数据是否符合特定的条件,Visual Studio 2010支持下面几种数据的单元测试类型(Visu ...
Ｃ#　以嵌入到窗体的方式打开外部exe
using System; using System.Collections.Generic; using System.Text; using System.Diagnostics; using S ...

CNN for Visual Recognition (assignment1_Q1)

CNN for Visual Recognition (assignment1_Q1)的更多相关文章

随机推荐

热门专题