Skip to content
This repository was archived by the owner on Feb 25, 2022. It is now read-only.
This repository was archived by the owner on Feb 25, 2022. It is now read-only.

Argument not a list with same length as devices #266

@monsieurpooh

Description

@monsieurpooh

Last few logs:

Defaulting to GELU activation (see here: https://arxiv.org/abs/1606.08415)
Defaulting to GELU activation (see here: https://arxiv.org/abs/1606.08415)
Defaulting to GELU activation (see here: https://arxiv.org/abs/1606.08415)
prediction_loop marked as finished
Reraising captured error
Traceback (most recent call last):
  File "main.py", line 257, in <module>
    main(args)
  File "main.py", line 184, in main
    handle_pred_output_fn(predictions, logger, enc, params, out_name=f"predictions_{args.sacred_id}_{current_step}")
  File "C:\Max\GPTNeo\inputs.py", line 165, in handle_pred_output
    for i, p in enumerate(predictions):
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\tpu\tpu_estimator.py", line 3153, in predict
    rendezvous.raise_errors()
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\tpu\error_handling.py", line 150, in raise_errors
    six.reraise(typ, value, traceback)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\six.py", line 703, in reraise
    raise value
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\tpu\tpu_estimator.py", line 3142, in predict
    for result in super(TPUEstimator, self).predict(
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\estimator.py", line 612, in predict
    estimator_spec = self._call_model_fn(features, None, ModeKeys.PREDICT,
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\tpu\tpu_estimator.py", line 2941, in _call_model_fn
    return super(TPUEstimator, self)._call_model_fn(features, labels, mode,
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\estimator.py", line 1163, in _call_model_fn
    model_fn_results = self._model_fn(features=features, **kwargs)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\tpu\tpu_estimator.py", line 3199, in _model_fn
    estimator_spec = model_fn_wrapper.call_without_tpu(
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\tpu\tpu_estimator.py", line 1729, in call_without_tpu
    return self._call_model_fn(features, labels, is_export_mode=is_export_mode)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\tensorflow_estimator\python\estimator\tpu\tpu_estimator.py", line 2072, in _call_model_fn
    estimator_spec = self._model_fn(features=features, **kwargs)
  File "C:\Max\GPTNeo\model_fns.py", line 112, in model_fn
    lowering = mtf.Lowering(graph, {mesh: mesh_impl}, autostack=True)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\mesh_tensorflow\ops.py", line 728, in __init__
    op.lower(self)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\mesh_tensorflow\ops.py", line 4541, in lower
    slices = mesh_impl.allsplit(slices, mesh_axis, tensor_axis)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\mesh_tensorflow\ops.py", line 1099, in allsplit
    which = self.laid_out_pcoord(mesh_axis)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\mesh_tensorflow\ops.py", line 1209, in laid_out_pcoord
    return self.slicewise(my_fn, self.laid_out_pnum())
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\mesh_tensorflow\placement_mesh_impl.py", line 173, in slicewise
    ret = mtf.parallel(self.devices, fn, *inputs)
  File "C:\Users\jerkm\AppData\Local\Programs\Python\Python38\lib\site-packages\mesh_tensorflow\ops.py", line 5659, in parallel
    raise ValueError(
ValueError: Argument not a list with same length as devices arg=[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255] devices=['device:GPU:0']

Describe the bug
A clear and concise description of what the bug is.

To Reproduce
Steps to reproduce the behavior:

  1. Follow all the steps in the readme and run the command python main.py --predict --prompt prompt.txt --model models/GPT3_2-7B/config.json

Environment (please complete the following information):

  • GPUs: Only using 1 NVIDIA GPU, very standard setup. Windows 10.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions