Global AI and Data Science

Global AI & Data Science

Train, tune and distribute models with generative AI and machine learning capabilities

 View Only
  • 1.  standard of Large Language Model Naming convention?

    Posted Mon January 22, 2024 11:30 AM

    I saw some LLM models have billions or trillion parameters.  Then it comes to instruction vs chat.   some added version at the end indicates of addition training with additional tokens

    In my opinion the LLMs naming is now similar to the car industry.  Everyone has their models with different names but it can be based on 3-cylinders or more with gas tank capacity , torque, horsepower and so on.   Any thoughts on standardizing how to  name LLMs moving forward? or if there is any scholar documents can show the naming, please let me know



    ------------------------------
    KWOK-BUN Lam
    IBM
    MARKHAM ON
    ------------------------------


  • 2.  RE: standard of Large Language Model Naming convention?

    Posted Tue January 23, 2024 07:18 AM

    I saw these two slides this morning at IBM TechXchange in Barcelona. Maybe the naming here can clarify some of your naming questions. 

    and


    ------------------------------
    Roland Schock
    IBM Champion and IBM Gold Consultant
    ------------------------------



  • 3.  RE: standard of Large Language Model Naming convention?

    Posted Tue January 23, 2024 09:20 AM

    Thanks I also found the following from the "Granite Foundation Models IBM Research, Updated November 30th, 2023"

    I would be just curious if the AI community or Big techs would sync up on the model naming convention and be useful what models can offer

    I guess there is none.   



    ------------------------------
    KWOK-BUN Lam
    IBM
    MARKHAM ON
    ------------------------------