吴恩达+neural-networks-deep-learning+第二周作业

Logistic Regression with a Neural Network mindset v4

简单用logistic实现了猫的识别，logistic可以被看做一个简单的神经网络结构，下面是主要代码：

import numpy as np

import matplotlib.pyplot as plt

import h5py

import scipy

from PIL import Image

from scipy import ndimage

from lr_utils import load_dataset

%matplotlib inline

### START CODE HERE ### (≈ 3 lines of code)

m_train = train_set_x_orig.shape[0]

m_test = test_set_x_orig.shape[0]

num_px = train_set_x_orig.shape[1]

### END CODE HERE ###

print ("Number of training examples: m_train = " + str(m_train))

print ("Number of testing examples: m_test = " + str(m_test))

print ("Height/Width of each image: num_px = " + str(num_px))

print ("Each image is of size: (" + str(num_px) + ", " + str(num_px) + ", 3)")

print ("train_set_x shape: " + str(train_set_x_orig.shape))

print ("train_set_y shape: " + str(train_set_y.shape))

print ("test_set_x shape: " + str(test_set_x_orig.shape))

print ("test_set_y shape: " + str(test_set_y.shape))

3.数据预处理过程

# Reshape the training and test examples

### START CODE HERE ### (≈ 2 lines of code)

train_set_x_flatten = train_set_x_orig.reshape(-1,train_set_x_orig.shape[1]*train_set_x_orig.shape[2]*3).T

test_set_x_flatten = test_set_x_orig.reshape(-1,test_set_x_orig.shape[1]*test_set_x_orig.shape[2]*3).T

### END CODE HERE ###

print ("train_set_x_flatten shape: " + str(train_set_x_flatten.shape))

print ("train_set_y shape: " + str(train_set_y.shape))

print ("test_set_x_flatten shape: " + str(test_set_x_flatten.shape))

print ("test_set_y shape: " + str(test_set_y.shape))

print ("sanity check after reshaping: " + str(train_set_x_flatten[0:5,0]))

注意：此处，不可用（num_px*num_px*3 ，-1），因为reshape默认以行分割，就是说我在确定一个reshape之后（M,N）现在我读取原数组按行读取，写入数组的时候也是按行写入的，所以我原数组的行是一幅图像，那么reshape数组的行也应该是一个图像，所以要写成，train_set_x_orig.reshape(-1,train_set_x_orig.shape[1]*train_set_x_orig.shape[2]*3)，而不是把样本数量当做行，那就乱了！

train_set_x = train_set_x_flatten/255.

test_set_x = test_set_x_flatten/255.

def propagate(w, b, X, Y):

    """

    Implement the cost function and its gradient for the propagation explained above

    Arguments:

    w -- weights, a numpy array of size (num_px * num_px * 3, 1)

    b -- bias, a scalar

    X -- data of size (num_px * num_px * 3, number of examples)

    Y -- true "label" vector (containing 0 if non-cat, 1 if cat) of size (1, number of examples)

    Return:

    cost -- negative log-likelihood cost for logistic regression

    dw -- gradient of the loss with respect to w, thus same shape as w

    db -- gradient of the loss with respect to b, thus same shape as b

    Tips:

    - Write your code step by step for the propagation. np.log(), np.dot()

    """

    m = X.shape[1]

    # FORWARD PROPAGATION (FROM X TO COST)

    ### START CODE HERE ### (≈ 2 lines of code)

    A = sigmoid(np.dot(w.T,X)+b)                                    # compute activation

    cost = -1/m*((np.dot(Y,np.log(A).T))+(np.dot(1-Y,np.log(1-A).T)))                                 # compute cost

    ### END CODE HERE ###

    # BACKWARD PROPAGATION (TO FIND GRAD)

    ### START CODE HERE ### (≈ 2 lines of code)

    dw = 1/m*np.dot(X,(A-Y).T)

    db = 1/m*np.sum(A-Y)

    ### END CODE HERE ###

    assert(dw.shape == w.shape)

    assert(db.dtype == float)

    cost = np.squeeze(cost)

    assert(cost.shape == ())

    grads = {"dw": dw,

             "db": db}

    return grads, cost

# GRADED FUNCTION: optimize

def optimize(w, b, X, Y, num_iterations, learning_rate, print_cost = False):

    """

    This function optimizes w and b by running a gradient descent algorithm

    Arguments:

    w -- weights, a numpy array of size (num_px * num_px * 3, 1)

    b -- bias, a scalar

    X -- data of shape (num_px * num_px * 3, number of examples)

    Y -- true "label" vector (containing 0 if non-cat, 1 if cat), of shape (1, number of examples)

    num_iterations -- number of iterations of the optimization loop

    learning_rate -- learning rate of the gradient descent update rule

    print_cost -- True to print the loss every 100 steps

    Returns:

    params -- dictionary containing the weights w and bias b

    grads -- dictionary containing the gradients of the weights and bias with respect to the cost function

    costs -- list of all the costs computed during the optimization, this will be used to plot the learning curve.

    Tips:

    You basically need to write down two steps and iterate through them:

        1) Calculate the cost and the gradient for the current parameters. Use propagate().

        2) Update the parameters using gradient descent rule for w and b.

    """

    costs = []

    for i in range(num_iterations):

        # Cost and gradient calculation (≈ 1-4 lines of code)

        ### START CODE HERE ###

        grads, cost = propagate(w,b,X,Y)

        ### END CODE HERE ###

        # Retrieve derivatives from grads

        dw = grads["dw"]

        db = grads["db"]

        # update rule (≈ 2 lines of code)

        ### START CODE HERE ###

        w = w-learning_rate*dw

        b = b-learning_rate*db

        ### END CODE HERE ###

        # Record the costs

        if i % 100 == 0:

            costs.append(cost)

        # Print the cost every 100 training examples

        if print_cost and i % 100 == 0:

            print ("Cost after iteration %i: %f" %(i, cost))

    params = {"w": w,

              "b": b}

    grads = {"dw": dw,

             "db": db}

    return params, grads, costs

# GRADED FUNCTION: predict

def predict(w, b, X):

    '''

    Predict whether the label is 0 or 1 using learned logistic regression parameters (w, b)

    Arguments:

    w -- weights, a numpy array of size (num_px * num_px * 3, 1)

    b -- bias, a scalar

    X -- data of size (num_px * num_px * 3, number of examples)

    Returns:

    Y_prediction -- a numpy array (vector) containing all predictions (0/1) for the examples in X

    '''

    m = X.shape[1]

    Y_prediction = np.zeros((1,m))

    w = w.reshape(X.shape[0], 1)

    # Compute vector "A" predicting the probabilities of a cat being present in the picture

    ### START CODE HERE ### (≈ 1 line of code)

    A = sigmoid(np.dot(w.T,X)+b)

    ### END CODE HERE ###

    #########

    Y_prediction=A>0.5

    Y_prediction=Y_prediction.astype(float)

    #########

    for i in range(A.shape[1]):

        # Convert probabilities A[0,i] to actual predictions p[0,i]

        ### START CODE HERE ### (≈ 4 lines of code)

        pass

        ### END CODE HERE ###

    assert(Y_prediction.shape == (1, m))

    return Y_prediction

用了一个向量化解决了循环问题，很开心！

# GRADED FUNCTION: model

def model(X_train, Y_train, X_test, Y_test, num_iterations = 2000, learning_rate = 0.5, print_cost = False):

    """

    Builds the logistic regression model by calling the function you've implemented previously

    Arguments:

    X_train -- training set represented by a numpy array of shape (num_px * num_px * 3, m_train)

    Y_train -- training labels represented by a numpy array (vector) of shape (1, m_train)

    X_test -- test set represented by a numpy array of shape (num_px * num_px * 3, m_test)

    Y_test -- test labels represented by a numpy array (vector) of shape (1, m_test)

    num_iterations -- hyperparameter representing the number of iterations to optimize the parameters

    learning_rate -- hyperparameter representing the learning rate used in the update rule of optimize()

    print_cost -- Set to true to print the cost every 100 iterations

    Returns:

    d -- dictionary containing information about the model.

    """

    ### START CODE HERE ###

    # initialize parameters with zeros (≈ 1 line of code)

    w, b = initialize_with_zeros(X_train.shape[0])

    # Gradient descent (≈ 1 line of code)

    parameters, grads, costs = optimize(w, b , X_train , Y_train , num_iterations , learning_rate , print_cost = False)

    # Retrieve parameters w and b from dictionary "parameters"

    w = parameters["w"]

    b = parameters["b"]

    # Predict test/train set examples (≈ 2 lines of code)

    Y_prediction_test = predict(w,b,X_test)

    Y_prediction_train = predict(w,b,X_train)

    ### END CODE HERE ###

    # Print train/test Errors

    print("train accuracy: {} %".format(100 - np.mean(np.abs(Y_prediction_train - Y_train)) * 100))

    print("test accuracy: {} %".format(100 - np.mean(np.abs(Y_prediction_test - Y_test)) * 100))

    d = {"costs": costs,

         "Y_prediction_test": Y_prediction_test,

         "Y_prediction_train" : Y_prediction_train,

         "w" : w,

         "b" : b,

         "learning_rate" : learning_rate,

         "num_iterations": num_iterations}

    print(d["costs"])

    return d

如果3的代码写反了，就变成34%的预测结果了，所以千万要注意细节！

吴恩达+neural-networks-deep-learning+第二周作业的更多相关文章

cousera 吴恩达深度学习第一课第二周作业过拟合的表现
上图是课上的编程作业运行10000次迭代后,输出每一百次迭代训练准确度和测试准确度的走势图,可以看到在600代左右测试准确度为最大的,74%左右, 然后掉到70%左右,再掉到68%左右,然后升到70 ...
(Deep) Neural Networks (Deep Learning) , NLP and Text Mining
(Deep) Neural Networks (Deep Learning) , NLP and Text Mining 最近翻了一下关于Deep Learning 或者普通的Neural Netw ...
Github | 吴恩达新书《Machine Learning Yearning》完整中文版开源
最近开源了周志华老师的西瓜书<机器学习>纯手推笔记: 博士笔记 | 周志华<机器学习>手推笔记第一章思维导图 [博士笔记 | 周志华<机器学习>手推笔记第二章&qu ...
吴恩达《深度学习》-第一门课 (Neural Networks and Deep Learning)-第二周：(Basics of Neural Network programming)-课程笔记
第二周:神经网络的编程基础 (Basics of Neural Network programming) 2.1.二分类(Binary Classification) 二分类问题的目标就是习得一个分类 ...
吴恩达《深度学习》-第二门课 (Improving Deep Neural Networks:Hyperparameter tuning, Regularization and Optimization)-第一周：深度学习的实践层面 (Practical aspects of Deep Learning) -课程笔记
第一周:深度学习的实践层面 (Practical aspects of Deep Learning) 1.1 训练,验证,测试集(Train / Dev / Test sets) 创建新应用的过程中, ...
吴恩达 Deep learning 第二周神经网络基础
逻辑回归代价函数(损失函数)的几个求导特性 1.对于sigmoid函数 2.对于以下函数 3.线性回归与逻辑回归的神经网络图表示利用Numpy向量化运算与for循环运算的显著差距 import nu ...
吴恩达《深度学习》第二门课（3）超参数调试、Batch正则化和程序框架
3.1调试处理 (1)不同超参数调试的优先级是不一样的,如下图中的一些超参数,首先最重要的应该是学习率α(红色圈出),然后是Momentum算法的β.隐藏层单元数.mini-batch size(黄色 ...
笔记 | 吴恩达新书《Machine Learning Yearning》
这本书共112页,内容不多,偏向于工程向,有很多不错的细节,在此记录一下. 0 书籍获取关注微信公众号"机器学习炼丹术",回复[MLY]获取pdf 1 测试集与训练集的比例 2 ...
吴恩达(Andrew Ng)——机器学习笔记1
之前经学长推荐,开始在B站上看Andrew Ng的机器学习课程.其实已经看了1/3了吧,今天把学习笔记补上吧. 吴恩达老师的Machine learning课程共有113节(B站上的版本https:/ ...
吴恩达《深度学习》-课后测验-第一门课 (Neural Networks and Deep Learning)-Week 2 - Neural Network Basics（第二周测验 - 神经网络基础）
Week 2 Quiz - Neural Network Basics(第二周测验 - 神经网络基础) 1. What does a neuron compute?(神经元节点计算什么?) [ ] A ...

随机推荐

【Python开发】Python 适合大数据量的处理吗？
Python 适合大数据量的处理吗? python 能处理数据库中百万行级的数据吗? 处理大规模数据时有那些常用的python库,他们有什么优缺点?适用范围如何? 需要澄清两点之后才可以比较全面的看这 ...
笔记本通过命令配置wifi win7系统
查看本子是否支持承载网络在开始菜单>附件>命令提示符(右键点击:以管理员身份运行) 命令行中输入以下内容,找到[支持的承载网络]这一行,如果为"是"就OK了,表示支持 ...
Palindromic Substrings
Given a string, your task is to count how many palindromic substrings in this string. The substrings ...
ORA-00911: invalid character解决方法
今天在搭建VLS系统后,登录系统测试时发现点击菜单提示错误“ORA-00911:???”.网上很多是因为语句中带分号导致的,但是这次是点开菜单就报错,怀疑是字符集设置的问题. 参考网上的解决方案,添加 ...
Python学习【day02】- Python基础练习题
#!/usr/bin/env python # -*- coding:utf8 -*- # 执行Python 脚本的两种方式 # 答:①在windows的cmd窗口下 > D:/Python/p ...
selenium与页面交互
selenium提供了许多API方法与页面进行交互,如点击.键盘输入.打开关闭网页.输入文字等. 一.webdriver对浏览器提供了很多属性来对浏览器进行操作,常用的如图: get(url).qui ...
Zabbix-自带监控项与Kye
开启主机自动发现 ⦁选择配置 ->自动发现 ->创建自动发现->ip范围必须连续,不连续的话逗号分开 ⦁更新间隔 -> 2s ->更新 ⦁检查 -> 选择新的 -& ...
css背景图自适应全屏显示
前几天我在写一个前端页面的时候,需要用到全屏背景图,但是怎么写都不行(要么不全屏,要么不兼容Bootstrap的响应式布局).对,是我腊鸡后来我在网上找的时候找到一个大神写的笔记,参(照)考(抄)之 ...
关于tomcat部署项目的问题
问题是这样的之前用tomcat8.5部署的项目,结果启动项目一直三个端口被占用,浏览器也打不开目标网页卸了8,装了9.先配置的一大堆,结果可以打开Tomcat的主页locahost:8080,到此 ...
Get MySQL这5个优化技巧
一个成熟的数据库架构并不是一开始设计就具备高可用.高伸缩等特性的,它是随着用户量的增加,基础架构才逐渐完善.这篇文章主要谈谈MySQL数据库在发展周期中所面临的问题及优化方案,暂且抛开前端应用不说,大 ...

吴恩达+neural-networks-deep-learning+第二周作业

吴恩达+neural-networks-deep-learning+第二周作业的更多相关文章

随机推荐

热门专题