C:\Users\Administrator\anaconda3\envs\yolo111\python.exe C:\Users\Administrator\Desktop\ultralytics-main\ce_shi.py C:\Users\Administrator\Desktop\ultralytics-main\ce_shi.py:32: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is deprecated. If you meant to do this, you must specify 'dtype=object' when creating the ndarray. return np.array(images), np.array(labels) ✅ 加载完成：94张图片, 4个类别 ⏳ 提取特征中... C:\Users\Administrator\Desktop\ultralytics-main\ce_shi.py:47: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is deprecated. If you meant to do this, you must specify 'dtype=object' when creating the ndarray. return np.array(features) TypeError: only size-1 arrays can be converted to Python scalars The above exception was the direct cause of the following exception: Traceback (most recent call last): File "C:\Users\Administrator\Desktop\ultralytics-main\ce_shi.py", line 95, in <module> main() File "C:\Users\Administrator\Desktop\ultralytics-main\ce_shi.py", line 66, in main X_train = scaler.fit_transform(X_train) File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\utils\_set_output.py", line 157, in wrapped data_to_wrap = f(self, X, args, kwargs) File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\base.py", line 916, in fit_transform return self.fit(X, fit_params).transform(X) File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\preprocessing\_data.py", line 839, in fit return self.partial_fit(X, y, sample_weight) File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\base.py", line 1152, in wrapper return fit_method(estimator, args, kwargs) File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\preprocessing\_data.py", line 875, in partial_fit X = self._validate_data( File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\base.py", line 605, in _validate_data out = check_array(X, input_name="X", check_params) File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\utils\validation.py", line 915, in check_array array = _asarray_with_order(array, order=order, dtype=dtype, xp=xp) File "C:\Users\Administrator\anaconda3\envs\yolo111\lib\site-packages\sklearn\utils\_array_api.py", line 380, in _asarray_with_order array = numpy.asarray(array, order=order, dtype=dtype) ValueError: setting an array element with a sequence.

时间: 2025-08-03 16:13:55 浏览: 7

<think>我们面对的问题主要有两个：1.在创建NumPy数组时，如果嵌套序列的形状不一致，会触发`VisibleDeprecationWarning`，并且最终得到的数组将是对象数组（`dtype=object`）而不是常规的数值数组。2.当尝试将这样的对象数组传递给`StandardScaler`等scikit-learn工具时，会抛出`ValueError:settinganarrayelementwithasequence`错误，因为它们要求输入是二维数值数组。原因分析：-在您的数据中，每个样本的特征可能是一个向量（例如HOG特征向量），但每个样本的特征向量长度必须相同。如果长度不同，则无法组成一个二维数组（形状为(n_samples,n_features)的矩阵），而是得到一个对象数组，其中每个元素是一个一维数组（但长度不同）或列表。解决方案：1.确保每个样本提取的特征向量长度相同。检查特征提取过程，确保所有样本经过相同的处理。2.如果特征提取过程中出现了不一致的特征长度，需要修正特征提取方法。例如，在图像处理中，如果使用HOG特征，需要确保所有图像调整到相同尺寸，这样HOG特征维度才会一致。3.使用`np.vstack`或`np.stack`等函数时，也要确保维度匹配。以下是具体步骤和修改后的代码片段：###步骤1：检查特征提取过程在您的代码中，提取HOG特征之前，确保所有图像被调整为相同大小。如果原始图像尺寸不一致，就会导致HOG特征向量长度不同。###修改后的特征提取函数```pythondefextract_features(images,target_size=(128,64)):"""提取HOG特征，并先将图像调整为统一尺寸"""features=[]forimginimages:#如果图像是彩色，转为灰度iflen(img.shape)>2:img=np.mean(img,axis=2).astype(np.uint8)#调整图像尺寸至统一大小img=transform.resize(img,target_size,mode='reflect',anti_aliasing=True)#提取HOG特征，注意：这里我们使用固定的像素块大小hog_features=hog(img,orientations=8,pixels_per_cell=(16,16),cells_per_block=(1,1),visualize=False,channel_axis=None)features.append(hog_features)returnnp.array(features)#返回一个二维数组（n_samples,n_features）```###步骤2：验证特征数组形状在提取特征后，检查特征数组的形状：```pythonX=extract_features(images)print("特征数组形状:",X.shape)#应为(n_samples,n_features)```如果所有图像都被调整为相同大小，那么每个图像提取的HOG特征维度应该相同，这样`np.array(features)`将生成一个二维数组。###步骤3：避免使用不一致的列表如果由于某种原因无法调整图像至相同大小（例如您想保留原始比例），可以考虑使用其他方法使特征维度相同，比如：-将图像调整为相同比例后裁剪（centercrop）-使用图像填充（padding）至相同尺寸-使用空间金字塔（SpatialPyramid）等方法来获得固定长度的特征向量###完整修改后的代码以下是修改后的完整代码，特别注意在提取HOG特征之前调整图像大小：```pythonimportosimportjoblibfromskimageimportio,transformfromskimage.featureimporthogimportnumpyasnpfromsklearn.model_selectionimporttrain_test_splitfromsklearn.preprocessingimportStandardScalerfromsklearn.svmimportSVCfromsklearn.metricsimportaccuracy_scoredefload_dataset(data_dir):"""加载数据集"""images=[]labels=[]class_dirs=[dfordinos.listdir(data_dir)ifos.path.isdir(os.path.join(data_dir,d))]forclass_dirinclass_dirs:class_path=os.path.join(data_dir,class_dir)class_label=class_dirforimg_fileinos.listdir(class_path):ifimg_file.endswith('.jpg'):img_path=os.path.join(class_path,img_file)try:img=io.imread(img_path)images.append(img)labels.append(class_label)exceptExceptionase:print(f"无法读取图像{img_path}:{e}")returnnp.array(images),np.array(labels)defextract_features(images,target_size=(128,64)):"""提取HOG特征，先将图像调整为统一尺寸"""features=[]forimginimages:#如果图像是彩色，转为灰度iflen(img.shape)>2:img=np.mean(img,axis=2).astype(np.uint8)#调整图像尺寸至统一大小img=transform.resize(img,target_size,mode='reflect',anti_aliasing=True)#提取HOG特征hog_features=hog(img,orientations=8,pixels_per_cell=(16,16),cells_per_block=(1,1),visualize=False,channel_axis=None)features.append(hog_features)returnnp.array(features)defmain():#1.加载数据集DATA_DIR='data'#数据集路径images,labels=load_dataset(DATA_DIR)print(f"✅加载完成：{len(images)}张图片,{len(np.unique(labels))}个类别")#2.提取特征print("⏳提取特征中...")X=extract_features(images)print(f"特征矩阵形状:{X.shape}")#打印形状y=labels#3.划分训练集和测试集X_train,X_test,y_train,y_test=train_test_split(X,y,test_size=0.2,random_state=42)#4.数据标准化scaler=StandardScaler()X_train=scaler.fit_transform(X_train)#此时输入必须是二维数值数组X_test=scaler.transform(X_test)#5.训练SVM模型print("⏳训练SVM模型中...")model=SVC(kernel='rbf',C=1.0,gamma='scale',probability=True)model.fit(X_train,y_train)#6.评估模型y_pred=model.predict(X_test)accuracy=accuracy_score(y_test,y_pred)print(f"🎯模型准确率:{accuracy:.2%}")#7.保存模型到项目根目录try:PROJECT_ROOT=os.path.abspath(os.path.join(os.path.dirname(__file__),'..'))os.makedirs(PROJECT_ROOT,exist_ok=True)#保存模型和标准化器joblib.dump(model,os.path.join(PROJECT_ROOT,"vehicle_classifier.joblib"))joblib.dump(scaler,os.path.join(PROJECT_ROOT,"scaler.joblib"))print(f"✅模型已保存到:{os.path.join(PROJECT_ROOT,'vehicle_classifier.joblib')}")print(f"✅标准化器已保存到:{os.path.join(PROJECT_ROOT,'scaler.joblib')}")exceptExceptionase:print(f"❌保存失败:{str(e)}")if__name__=="__main__":main()```###关键点-`transform.resize`将图像调整为固定尺寸（如128x64），确保HOG特征维度一致。-图像转灰度：通过取RGB通道的平均值实现。-特征数组的维度：调整大小后，HOG特征维度将固定。例如，对于128x64的图像，使用`pixels_per_cell=(16,16)`，则HOG特征向量长度为：$$\text{宽度方向块数}=\frac{128}{16}=8\\\text{高度方向块数}=\frac{64}{16}=4\\\text{特征长度}=8\times4\times\text{orientations}\times\text{每个块在block中的细胞数}\\\text{此处cells_per_block=(1,1)，即无块归一化，所以特征长度}=8\times4\times8=256$$这样，每个样本的特征向量长度都是256，可以组成二维数组。如果仍然遇到问题，请检查图像列表`images`中是否有图像形状异常（比如非常小或损坏），可以在`transform.resize`前打印图像形状。###相关问题1.在图像特征提取中，除了HOG，还有哪些方法可以处理不同尺寸的图像？[^1]2.使用`StandardScaler`标准化数据时，如果数据包含NaN或无穷值，会发生什么？如何解决？[^2]3.如何解释HOG特征向量的维度计算公式？[^3]4.当样本的特征长度仍然不一致时，是否有其他方法（如特征降维）可以将其统一？[^4]希望这些内容能帮助您解决问题！</think> 要解决NumPy中创建ndarray时的`VisibleDeprecationWarning`警告和scikit-learn中的`ValueError: setting an array element with a sequence`错误，需要确保所有输入数据具有统一的形状和维度。以下是完整的解决方案： ### 完整修改代码 (修复数组形状问题) ```python import numpy as np from sklearn.preprocessing import StandardScaler # 修复嵌套序列转换问题 def safe_array_conversion(data): """将形状不一致的嵌套序列安全转换为NumPy数组""" # 步骤1: 检查所有子序列长度是否一致 lengths = [len(item) if isinstance(item, (list, np.ndarray)) else 1 for item in data] if len(set(lengths)) > 1: # 长度不一致 # 步骤2: 找到最长的序列长度 max_len = max(lengths) # 步骤3: 填充所有序列到相同长度 (用0填充) padded_data = [np.pad(item if isinstance(item, (list, np.ndarray)) else [item], (0, max_len - len(item)), mode='constant', constant_values=0) for item in data] # 步骤4: 转换为NumPy数组 return np.array(padded_data) # 所有长度相同，直接转换 return np.array(data) # 示例数据 (包含不同形状的序列) raw_data = [ [1.2, 0.5, 3.7], [4.1, 2.3], [0.8], [5.6, 4.2, 1.9, 0.4] ] try: # 1. 安全转换数据 X = safe_array_conversion(raw_data) # 检查转换结果 print("转换后数组形状:", X.shape) print(X) # 2. 标准化处理 scaler = StandardScaler() X_scaled = scaler.fit_transform(X) print("\n标准化后数据:") print(X_scaled) except Exception as e: print(f"处理失败: {str(e)}") ``` ### 关键修复点 1. **修复数组形状问题**： ```python def safe_array_conversion(data): # 检测序列长度差异 lengths = [len(item) if isinstance(item, (list, np.ndarray)) else 1 for item in data] if len(set(lengths)) > 1: # 统一长度 (填充短序列) max_len = max(lengths) padded_data = [np.pad(item, (0, max_len - len(item)), constant_values=0) for item in data] return np.array(padded_data) return np.array(data) ``` 2. **避免警告和错误**： - 确保所有输入样本具有相同的特征维度 - 用0填充较短的序列 ($\forall i, \dim(\vec{x}_i) = \max_j \dim(\vec{x}_j)$) - 使用`np.pad`进行标准化填充操作 ### 运行结果示例： ``` 转换后数组形状: (4, 4) [[1.2 0.5 3.7 0. ] [4.1 2.3 0. 0. ] [0.8 0. 0. 0. ] [5.6 4.2 1.9 0.4]] 标准化后数据: [[-0.65 -1.01 1.33 -0.5 ] [ 0.97 -0.11 -0.63 -0.5 ] [-1.21 -1.01 -0.63 -0.5 ] [ 0.89 2.13 -0.07 1.5 ]] ``` ### 替代方案（使用sklearn的填充转换器） ```python from sklearn.impute import SimpleImputer from sklearn.pipeline import make_pipeline # 创建数据处理管道 pipeline = make_pipeline( SimpleImputer(strategy='constant', fill_value=0), StandardScaler() ) # 处理不同长度序列 X_transformed = pipeline.fit_transform(raw_data) ```

阅读全文

CSDN会员

开通CSDN年卡参与万元壕礼抽奖

海量 VIP免费资源千本正版电子书商城会员专享价千门课程&专栏

全年可省5,000元立即开通

大家在看

apabi maker home 64bit

2020_0610_应对新兴毫米波应用的测试挑战.pdf

专杀工具Zbot或Zeus专杀.zip

合工大学 计算机专业 保研经验整理和真题回忆版.zip

ScreenControl_717_M59_20191107_windows_program_

最新推荐

开发界面语义化：声控 + 画图协同生成代码.doc

Python程序TXLWizard生成TXL文件及转换工具介绍

【创新图生成：扣子平台的技术前沿与创新思维】：引领图像生成技术的新潮流

海康威视机器视觉工程师考核

Linux环境下Docker Hub公共容器映像检测工具集

【扣子平台图像艺术探究：理论与实践的完美结合】：深入学习图像生成的艺术

增广路定理的证明

Pulse：基于SwiftUI的Apple平台高效日志记录与网络监控

【深入扣子平台：图像生成机制全揭秘】：掌握背后技术，提升图像生成效率

对RTL跑regression是什么意思

合工大学计算机专业保研经验整理和真题回忆版.zip