吴恩达机器学习第二周编程题Practice Lab Linear Regression

原创于 2023-10-27 13:01:38 发布 · 172 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#机器学习 #线性回归 #python #人工智能

吴恩达机器学习专栏收录该内容

22 篇文章

订阅专栏

文章讲述了如何在线性回归中计算成本函数（costfunction）和梯度，包括对每个训练样例的预测误差、总成本的计算以及参数w和b的梯度更新过程。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

直接套用学习的公式进行计算即可。

Exercise 1

Complete the compute_cost below to:

Iterate over the training examples, and for each example, compute:
- The prediction of the model for that example
  $f_{wb}(x^{(i)}) = wx^{(i)} + b$
- The cost for that example $cost^{(i)} = (f_{wb} - y^{(i)})^2$
Return the total cost over all examples
$J(w,b)=12m∑i=0m−1cost(i)J(\mathbf{w},b) = \frac{1}{2m} \sum\limits_{i = 0}^{m-1} cost^{(i)}$
- Here, $m$ is the number of training examples and $∑\sum$ is the summation operator

代码：
def compute_cost(x, y, w, b): 
    """
    Computes the cost function for linear regression.
    
    Args:
        x (ndarray): Shape (m,) Input to the model (Population of cities) 
        y (ndarray): Shape (m,) Label (Actual profits for the cities)
        w, b (scalar): Parameters of the model
    
    Returns
        total_cost (float): The cost of using w,b as the parameters for linear regression
               to fit the data points in x and y
    """
    # number of training examples
    m = x.shape[0] 
    # You need to return this variable correctly
    total_cost = 0
    ### START CODE HERE ###
    for i in range(m):
        total_cost+=((b+w*x[i])-y[i])**2
    total_cost/=2*m
    return total_cost

Exercise 2

Please complete the compute_gradient function to:

Iterate over the training examples, and for each example, compute:
- The prediction of the model for that example
  $f_{wb}(x^{(i)}) = wx^{(i)} + b$
- The gradient for the parameters $w, b$ from that example
  $\frac{\partial J(w,b)}{\partial b}^{(i)} = (f_{w,b}(x^{(i)}) - y^{(i)})$
  $\frac{\partial J(w,b)}{\partial w}^{(i)} = (f_{w,b}(x^{(i)}) -y^{(i)})x^{(i)}$
Return the total gradient update from all the examples
$\frac{\partial J(w,b)}{\partial b} = \frac{1}{m} \sum\limits_{i = 0}^{m-1} \frac{\partial J(w,b)}{\partial b}^{(i)}$

$\frac{\partial J(w,b)}{\partial w} = \frac{1}{m} \sum\limits_{i = 0}^{m-1} \frac{\partial J(w,b)}{\partial w}^{(i)}$
- Here, $m$ is the number of training examples and $∑\sum$ is the summation operator

 代码：
 def compute_gradient(x, y, w, b): 
    """
    Computes the gradient for linear regression 
    Args:
      x (ndarray): Shape (m,) Input to the model (Population of cities) 
      y (ndarray): Shape (m,) Label (Actual profits for the cities)
      w, b (scalar): Parameters of the model  
    Returns
      dj_dw (scalar): The gradient of the cost w.r.t. the parameters w
      dj_db (scalar): The gradient of the cost w.r.t. the parameter b     
     """
    
    # Number of training examples
    m = x.shape[0]
    
    # You need to return the following variables correctly
    dj_dw = 0
    dj_db = 0
    
    ### START CODE HERE ###
    for i in range(m):
        dj_db+=(w*x[i]+b)-y[i]
        dj_dw+=((w*x[i]+b)-y[i])*x[i]
    dj_dw=dj_dw/m
    dj_db=dj_db/m
    ### END CODE HERE ###  
    return dj_dw, dj_db