license: apache-2.0

!apt-get install aria2 !aria2c -x 16 -s 16

!./llama-gguf-split --merge Llama-3.1-Nemotron-70B-Instruct-HF-Q8_0-00001-of-00002.gguf Nemotron-70B-Instruct-HF-Q8_0.gguf

!/content/llama.cpp/llama-gguf-split --split-max-size 10G /content/llama.cpp/Nemotron-70B-Instruct-HF-Q8_0.gguf /content/Nemotron-70B-Instruct-HF-Q8

from huggingface_hub import upload_folder

مسار المجلد المراد رفعه

folder_path = "/content/split_model" # استبدل هذا بالمسار الصحيح

repo_id = "sdyy/Nemotron-70B-Instruct-HF-Q8_8parts"

repo_folder_name = "split_model" # استبدل هذا بالاسم الذي تريده

token = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"

upload_folder( folder_path=folder_path, repo_id=repo_id, repo_type="model", token=token, )

GGUF

Model size

71B params

Architecture

llama

Hardware compatibility

We're not able to determine the quantization variants.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support