+
Skip to content
View akanyaani's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report akanyaani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
akanyaani/README.md

👋 Welcome to My GitHub Profile!

I'm Abhay Kumar, Senior LLM Research Engineer at BluOrion, with a deep passion for training neural networks, particularly language models and large language models (LLMs). With over 8 years of experience in Data Science and NLP.

🌟 Highlights

  • Senior LLM Research Engineer at BluOrion
  • Co-Author of Komodo LLM: An open-source, language-specific LLM for Indonesian.
  • 8+ years of experience in Data Science and NLP, with a focus on Language Modeling.

💡 Projects/Papers

  • ZClip: An Adaptive Spike Mitigation for LLM Pre-Training.
  • Variance Control: Variance Control via Weight Rescaling in LLM Pre-training.
  • Komodo LLM: A foundational large language model tailored for a specific language.
  • miniLLama: A straightforward and compact implementation of the LLAMA Model, inspired by Andrej Karpathy's minGPT.
  • GPT2-TF: Implementation of GPT-2 in TensorFlow 2, recognized as the first repository for GPT-2 in TensorFlow 2.

Pinned Loading

  1. gpt-2-tensorflow2.0 gpt-2-tensorflow2.0 Public

    OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

    Python 263 81

  2. miniLLAMA miniLLAMA Public

    A simplified LLAMA implementation for training and inference tasks.

    Python 33 3

  3. ranknet-tensorflow2.0 ranknet-tensorflow2.0 Public

    Implementation of RankNet to LambdaRank in TensorFlow 2.0

    Python 41 7

  4. Illustrated_GPT2_With_Code Illustrated_GPT2_With_Code Public

    Explained GPT-2 Transformer model step by step with code.

    Jupyter Notebook 17 4

  5. minGPTF minGPTF Public

    A TF re-implementation of the Karpathy's minGPT (Generative Pretrained Transformer) training

    Python 7 1

  6. Phrase_Extraction_Bi-LSTM Phrase_Extraction_Bi-LSTM Public

    Phrase Extraction using Bi Directional LSTM

    Python 11

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载