DataMuncher Labs

in DataMuncher-Labs/README 2 days ago

Would be good if you had a row reference for the bucket

#3 opened 2 days ago by

in DataMuncher-Labs/README 2 days ago

Would be good if you had a row reference for the bucket

#3 opened 2 days ago by

in DataMuncher-Labs/REASONING-X10000 13 days ago

Parqet

#3 opened 13 days ago by

in DataMuncher-Labs/REASONING-X10000 13 days ago

Parqet

#3 opened 13 days ago by

in DataMuncher-Labs/REASONING-X10000 14 days ago

Update Readme.md for more information.

#2 opened 15 days ago by

in DataMuncher-Labs/REASONING-X10000 15 days ago

Update Readme.md for more information.

#2 opened 15 days ago by

Viewer • Updated 14 days ago • 453k • 66 • 1

updated a dataset 15 days ago

DataMuncher-Labs/REASONING-X10000

Viewer • Updated 14 days ago • 453k • 66 • 1

published a dataset 15 days ago

DataMuncher-Labs/REASONING-X10000

posted an update 23 days ago

Post

4187

sKT-Ai-Labs

Join fast we will soon published tokens and all join and get started because we will soon off join request button if you want you can join fast guys

1 reply

posted an update 28 days ago

Post

2594

🚀 Bharat AI Revolution ka Hissa Banein! 🇮🇳

Kya aap Bharat ko AI ki duniya mein ek nayi pehchan dilana chahte hain ?

SKT AI Labs sirf ek naam nahi, ek mission hai—desh ko digital shakti dene ka aur "Viksit Bharat" ke sapne ko sach karne ka.

Humse Kyun Judein?

1. Desh ka Apna AI: Hum aise models bana rahe hain jo khas taur par Bharat ki zarooraton aur bhashaon ke liye hain.

2. Open Collaboration: Hamare Hugging Face repository par hamare kaam ko dekhein, test karein aur apna yogdan dein.

3. Technological Growth: Agar aap student hain, developer hain ya tech enthusiast hain, toh hamare saath naya seekhne aur grow karne ka yeh behtareen mauka hai.

Join here

sKT-Ai-Labs
🔗

sKT-Ai-Labs

Aaiye, saath milkar Bharat AI Revolution ko aage badhate hain! 💻🔥

#SKTAILabs #DigitalIndia #AIRevolution #ViksitBharat #TechInnovation #JoinTheMission

posted an update 29 days ago

Post

6848

SOME NEW HINDI + ENGLISH DATASETS

🔗
- sKT-Ai-Labs/HIN
- sKT-Ai-Labs/SKT-MIX
- sKT-Ai-Labs/ST-H

Download and Use And Train Models

You Can Alsoo Use ST-x-LIGHTING Module For Faster Training

pip install ST-x-LIGHT-V11

2 replies

posted an update about 1 month ago

Post

5593

We are thrilled to announce the launch of SKT-OMNI-CORPUS-146T-V1, a massive-scale, high-quality dataset designed to power the next generation of Foundation Models (LLMs) from scratch.
Developed at SKT AI LABS, this corpus is not just a collection of data; it’s a mission to decentralize high-grade AI training for regional languages and global knowledge.

💎 Key Highlights:

•• Massive Scale: Targeting a multi-terabyte architecture for 146T-level tokenization.

•• Pure Quality: Curated from 500+ Elite Sources

•• Structured for MoE: Perfectly sharded into 3.5GB standardized units (SKT-𝕻 series) for seamless distributed training.

🤝 Open for Collaboration!

We are looking for AI researchers, CUDA engineers, and data scientists to join us in this journey of building Project Surya and the ST-X Series models. Whether it's optimization, custom tokenization, or architecture design—let’s build the future together.

Explore the Dataset on Hugging Face:

🔗 https://huggingface.co/datasets/Shrijanagain/SKT-OMNI-CORPUS-146T-V1

DSR -- 🔗 https://huggingface.co/datasets/Shrijanagain/SKT-DSRx10000

#AI #MachineLearning #OpenSource #IndicAI #SKTAILABS #LLM #BigData #HuggingFace #InnovationIndia

in DataMuncher-Labs/README about 1 month ago

Quantum Computing

#2 opened 3 months ago by

in DataMuncher-Labs/README about 1 month ago

Quantum Computing

#2 opened 3 months ago by

in DataMuncher-Labs/UltraMath-Reasoning-Small about 2 months ago

Update README.md

#2 opened 4 months ago by

in DataMuncher-Labs/UltraMath-Reasoning-Small about 2 months ago

Update README.md

#2 opened 4 months ago by