问题描述
今天遇到这样一个问题:RuntimeError: No CUDA GPUs are available
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 466774) of binary: /home/visionx/anaconda3/envs/globetrotter/bin/python
完整描述是:
/home/visionx/anaconda3/envs/globetrotter/lib/python3.8/site-packages/transformers/deepspeed.py:23: FutureWarning: transformers.deepspeed module is deprecated and will be removed in a future version. Please import deepspeed modules directly from transformers.integrations
warnings.warn(
No CUDA runtime is found, using CUDA_HOME=':/usr/local/cuda'
2024-04-19 13:10:41 | INFO | fairseq.tasks.text_to_speech | Please install tensorboardX: pip install tensorboardX
Namespace(alpha_xm=False, augment_image=True, batch_size=128, checkpoint_dir='checkpoints', checkpoint_path='checkpoints/train_sigurdsson', checkpoint_path_load_from='checkpoints/train_sigurdsson', config_arch='config', config_data='all-lang_test-zh-en', dataset_info_path='dataset_info', dataset_path='dataset', debug=False, evaluate=False, fp16=True, image_size=224, lambda_lm_loss=0.0, lambda_orthogonality_loss=1.0, lambda_visual_loss=0.0, lambda_xlang_loss=0.0, lambda_xm_loss=1.0, language_split='training', learning_rate=0.001, local_rank=0, max_txt_seq_len=50, momentum_bn=0.1, name='train_sigurdsson', not_use_images=False, num_epochs=100, opt_level='O1', output_attentions=False, p_clobbe