Package Details: python-datasets 3.2.0-1

Git Clone URL: https://aur.archlinux.org/python-datasets.git (read-only, click to copy)
Package Base: python-datasets
Description: The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Upstream URL: https://github.com/huggingface/datasets
Keywords: datasets deep-learning huggingface machine-learning
Licenses: Apache
Groups: huggingface
Submitter: trougnouf
Maintainer: daskol
Last Packager: daskol
Votes: 4
Popularity: 0.78
First Submitted: 2021-09-01 14:06 (UTC)
Last Updated: 2024-12-11 08:20 (UTC)

Pinned Comments

daskol commented on 2024-09-11 19:59 (UTC)

WARNING HuggingFace usually needs some time to stabilize major or even minor release version. Sometimes it requires up to 5-6 patch releases. So, be careful updating python-datasets up to major v3 release.

Latest Comments

1 2 Next › Last »

daskol commented on 2024-09-11 19:59 (UTC)

WARNING HuggingFace usually needs some time to stabilize major or even minor release version. Sometimes it requires up to 5-6 patch releases. So, be careful updating python-datasets up to major v3 release.

pallegro commented on 2023-11-14 02:35 (UTC)

checksum fails. For 2.14.6 I consistently get sha256 of 761442be44e2b3c7471851dc241ed51bdab6ad352e4a753443e4c4a579da4d88

daskol commented on 2023-09-06 13:20 (UTC)

@12eason Package python-datasets is pure Python itself but it depends on multiple platform-dependent packages like python-pytorch, python-safetensors, python-sentencepiece and others.

12eason commented on 2023-09-05 19:35 (UTC)

python -c "from datasets import load_dataset; print(load_dataset('squad', split='train')[0])" [1] 35446 illegal hardware instruction (core dumped) python -c

Does this library have some sort of supersecret dependency on SSE3? It's driving me crazy.

Relih commented on 2023-02-09 21:51 (UTC)

@daskol I managed to upgrade the package by manually removing the version restrictions. My guess is that the version format of the actually installed package doesn't match the format that is in the PKGBUILD and so the dependency can never satisfy the constraint.

daskol commented on 2023-02-08 11:02 (UTC)

@Relih No idea how to resolve dependency conflict. I guess that python-datasets depends on python-huggingface-hub as well as python-evaluate which depends on python-datasets in advance. May be this is the issue, not sure.

Relih commented on 2023-02-07 22:19 (UTC)

@daskol Here you go

Name            : python-huggingface-hub-git
Version         : 0.13.0.dev0-1
Description     : Client library to download and publish models on the huggingface.co hub
Architecture    : any
URL             : https://github.com/huggingface/huggingface_hub
Licenses        : Apache 2.0
Groups          : None
Provides        : python-huggingface-hub
Depends On      : python>=3.6  python-filelock  python-tqdm  python-requests  python-importlib-metadata
Optional Deps   : python-espnet-git: ASR toolkit
                  python-speechbrain-git: ASR toolkit
Required By     : python-datasets  python-evaluate  python-transformers
Optional For    : None
Conflicts With  : python-huggingface-hub
Replaces        : None
Installed Size  : 1757,33 KiB
Packager        : Unknown Packager
Build Date      : Di 07 Feb 2023 22:17:32 CET
Install Date    : Di 07 Feb 2023 22:17:41 CET
Install Reason  : Installed as a dependency for another package
Install Script  : No
Validated By    : None

daskol commented on 2023-02-07 22:00 (UTC)

@Relih Sorry, I meant packages which requires python-huggingface-hub.

$ pacman -Qi python-huggingface-hub

How do you install python-datasets? What tool do you use?

Relih commented on 2023-02-07 21:34 (UTC)

@daskol sure can do, here is the output

Name            : python-datasets
Version         : 2.4.0-1
Description     : Library providing one-line dataloaders for many public datasets and efficient data pre-processing
Architecture    : x86_64
URL             : https://github.com/huggingface/datasets
Licenses        : Apache
Groups          : None
Provides        : None
Depends On      : python-pyarrow  python-dill  python-pandas  python-xxhash  python-multiprocess  python-aiohttp  python-huggingface-hub  python-fsspec
Optional Deps   : None
Required By     : python-evaluate
Optional For    : None
Conflicts With  : None
Replaces        : None
Installed Size  : 3,65 MiB
Packager        : Unknown Packager
Build Date      : So 07 Aug 2022 18:25:24 CEST
Install Date    : So 07 Aug 2022 18:25:39 CEST
Install Reason  : Explicitly installed
Install Script  : No
Validated By    : None

daskol commented on 2023-02-07 21:28 (UTC)

@Relih Thanks! Much better now. It seems that you have some packages which depend onpython-huggingface-hub or python-huggingface-hub-git. Can you find decendants of python-datasets?

pacman -Qi python-datasets