RuntimeError: Dataset scripts are no longer supported, but found wikipedia.py

运行如下代码时出现错误

from datasets import load_dataset
ds_name="wikipedia"
raw_ds = load_dataset(
        ds_name,
    dict(wikitext="wikitext-103-raw-v1", wikipedia="20220301.en")[ds_name]
    )

错误:

RuntimeError: Dataset scripts are no longer supported, but found wikipedia.py

原因就是datasets包太新了,应该是2025年datasets-4.0.0会出现这个问题。

解决办法就是将包的版本降级。下面这个可以,但是需要在load_dataset函数中加上参数trust_remote_code=True

pip install datasets==3.6.0

下面这个可以直接运行

pip install datasets==2.18.0
"C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\python.exe" C:\Users\Administrator\PycharmProjects\pythonProject2\2.py Traceback (most recent call last): File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, in <module> from tensorflow.python._pywrap_tensorflow_internal import * ImportError: DLL load failed while importing _pywrap_tensorflow_internal: 动态链接库(DLL)初始化例程失败。 During handling of the above exception, another exception occurred: Traceback (most recent call last): File "C:\Users\Administrator\PycharmProjects\pythonProject2\2.py", line 2, in <module> from keras.models import Sequential File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\__init__.py", line 7, in <module> from keras import _tf_keras as _tf_keras File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\_tf_keras\__init__.py", line 1, in <module> from keras._tf_keras import keras File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\_tf_keras\keras\__init__.py", line 7, in <module> from keras import activations as activations File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\activations\__init__.py", line 7, in <module> from keras.src.activations import deserialize as deserialize File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\__init__.py", line 1, in <module> from keras.src import activations File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\activations\__init__.py", line 3, in <module> from keras.src.activations.activations import celu File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\activations\activations.py", line 1, in <module> from keras.src import backend File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\backend\__init__.py", line 10, in <module> from keras.src.backend.common.dtypes import result_type File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\backend\common\__init__.py", line 2, in <module> from keras.src.backend.common.dtypes import result_type File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\backend\common\dtypes.py", line 5, in <module> from keras.src.backend.common.variables import standardize_dtype File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\backend\common\variables.py", line 11, in <module> from keras.src.utils.module_utils import tensorflow as tf File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\utils\__init__.py", line 1, in <module> from keras.src.utils.audio_dataset_utils import audio_dataset_from_directory File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\utils\audio_dataset_utils.py", line 4, in <module> from keras.src.utils import dataset_utils File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\utils\dataset_utils.py", line 9, in <module> from keras.src import tree File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\tree\__init__.py", line 1, in <module> from keras.src.tree.tree_api import assert_same_paths File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\tree\tree_api.py", line 8, in <module> from keras.src.tree import optree_impl as tree_impl File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\keras\src\tree\optree_impl.py", line 13, in <module> from tensorflow.python.trackable.data_structures import ListWrapper File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\tensorflow\__init__.py", line 40, in <module> from tensorflow.python import pywrap_tensorflow as _pywrap_tensorflow # pylint: disable=unused-import File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 88, in <module> raise ImportError( ImportError: Traceback (most recent call last): File "C:\Program Files (x86)\Microsoft Visual Studio\Shared\Python39_64\lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, in <module> from tensorflow.python._pywrap_tensorflow_internal import * ImportError: DLL load failed while importing _pywrap_tensorflow_internal: 动态链接库(DLL)初始化例程失败。 Failed to load the native TensorFlow runtime. See https://www.tensorflow.org/install/errors for some common causes and solutions. If you need help, create an issue at https://github.com/tensorflow/tensorflow/issues and include the entire stack trace above this error message. Process finished with exit code 1
06-15
Traceback (most recent call last): File "Y:\Anaconda3\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, in <module> from tensorflow.python._pywrap_tensorflow_internal import * ImportError: DLL load failed while importing _pywrap_tensorflow_internal: 动态链接库(DLL)初始化例程失败。 During handling of the above exception, another exception occurred: Traceback (most recent call last): File "E:\PYCharm\数据\ONNX 转换为 TFLite.py", line 1, in <module> from onnx_tf.backend import prepare File "Y:\Anaconda3\Lib\site-packages\onnx_tf\__init__.py", line 1, in <module> from . import backend File "Y:\Anaconda3\Lib\site-packages\onnx_tf\backend.py", line 21, in <module> import tensorflow as tf File "Y:\Anaconda3\Lib\site-packages\tensorflow\__init__.py", line 40, in <module> from tensorflow.python import pywrap_tensorflow as _pywrap_tensorflow # pylint: disable=unused-import ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "Y:\Anaconda3\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 88, in <module> raise ImportError( ImportError: Traceback (most recent call last): File "Y:\Anaconda3\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, in <module> from tensorflow.python._pywrap_tensorflow_internal import * ImportError: DLL load failed while importing _pywrap_tensorflow_internal: 动态链接库(DLL)初始化例程失败。 Failed to load the native TensorFlow runtime. See https://www.tensorflow.org/install/errors for some common causes and solutions. If you need help, create an issue at https://github.com/tensorflow/tensorflow/issues and include the entire stack trace above this error message.
07-17
### 关于 `RuntimeError: Dataset 'data.yaml' error` 的分析 当遇到 `RuntimeError: Dataset 'data.yaml' error` 类型的错误时,通常是因为配置文件 `data.yaml` 中定义的数据集路径存在问题。以下是可能的原因及其解决方案: #### 原因一:路径不存在 如果指定的路径无法找到,则会出现此错误。例如,在引用中提到的情况[^1],可能是由于相对路径未正确解析。 - **解决方法** 将 `data.yaml` 文件中的路径更改为绝对路径。例如,假设项目结构如下: ```plaintext project/ ├── dataset/ │ ├── images/ │ └── labels/ └── mytrain.yaml ``` 可以修改 `mytrain.yaml` 如下所示[^3]: ```yaml path: /absolute/path/to/work/projectdemo/MTdataset train: /absolute/path/to/work/projectdemo/MTdataset/train/images val: /absolute/path/to/work/projectdemo/MTdataset/valid/images names: 0: designError 1: hole 2: linerError ``` #### 原因二:YAML 格式错误 YAML 文件对格式非常敏感,尤其是缩进和冒号后的空格。如果没有遵循 YAML 的语法规则,可能会引发类似的错误[^5]。 - **解决方法** 确保每行的关键字后都有一个空格。例如: ```yaml path: /absolute/path/to/dataset train: /absolute/path/to/dataset/train/images val: /absolute/path/to/dataset/val/images test: /absolute/path/to/dataset/test/images nc: 2 names: - dog - cat ``` #### 原因三:映射值语法问题 某些情况下,YAML 解析器可能因为非法字符或不支持的操作而抛出异常。例如,字符串对象不允许赋值操作。 - **解决方法** 如果报错提示类似于 `mapping values are not allowed` 或 `'str' object does not support item assignment`,可以尝试重新编辑 YAML 文件并保存,确保没有多余的特殊字符或隐藏符号。 #### 示例代码 以下是一个完整的 `data.yaml` 配置示例,适用于 YOLOv8 训练任务: ```yaml path: /home/user/yolo_dataset train: /home/user/yolo_dataset/images/train val: /home/user/yolo_dataset/images/val test: /home/user/yolo_dataset/images/test nc: 2 names: - class_1 - class_2 ``` --- ### 总结 通过上述调整,能够有效解决由路径缺失、格式错误或其他技术细节引起的 `RuntimeError: Dataset 'data.yaml' error` 报错。务必确认路径存在且为绝对路径,并严格遵守 YAML 文件的书写规范。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

音程

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值