Contributions de Jboscher

09:3319 septembre 2023 à 09:33 diff hist +208‎ N AI Agent ‎ Page créée avec « AI agents are artificial entities that sense their environment, make decisions, and take actions. The Rise and Potential of Large Language Model Based Agents: A Survey : https://arxiv.org/pdf/2309.07864.pdf » actuelle

12:101 août 2023 à 12:10 diff hist +709‎ SFT ‎Aucun résumé des modifications actuelle
12:001 août 2023 à 12:00 diff hist +404‎ N SFT ‎ Page créée avec « Supervised Fine-Tuning (SFT): Models are trained on a dataset of instructions and responses. It adjusts the weights in the LLM to minimize the difference between the generated answers and ground-truth responses, acting as labels. == Références == * [https://towardsdatascience.com/fine-tune-your-own-llama-2-model-in-a-colab-notebook-df9823a04a32] Fine-Tune Your Own Llama 2 Model in a Colab Notebook »

09:3427 juillet 2023 à 09:34 diff hist +373‎ N PEFT ‎ Page créée avec « Thus, we, as a com- munity of researchers and engineers, need efficient ways to train on downstream task data. Parameter-efficient fine-tuning, which we denote as PEFT, aims to resolve this problem by only training a small set of parameters which might be a subset of the existing model parameters or a set of newly added parameters. https://arxiv.org/pdf/2303.15647.pdf » actuelle

21:195 juillet 2023 à 21:19 diff hist 0‎ Agent ‎Aucun résumé des modifications actuelle
21:185 juillet 2023 à 21:18 diff hist +1 406‎ N Agent ‎ Page créée avec « In LangChain, agents are high-level components that use language models (LLMs) to determine which actions to take and in what order. An action can either be using a tool and observing its output or returning it to the user. Tools are functions that perform specific duties, such as Google Search, database lookups, or Python REPL. Agents involve an LLM making decisions about which Actions to take, taking that Action, seeing an Observation, and repeating that until... »

10:154 juillet 2023 à 10:15 diff hist +393‎ Temperature ‎Aucun résumé des modifications actuelle
10:144 juillet 2023 à 10:14 diff hist +169‎ Prompt design ‎Aucun résumé des modifications actuelle
10:134 juillet 2023 à 10:13 diff hist +228‎ Zero-shot ‎Aucun résumé des modifications actuelle
10:124 juillet 2023 à 10:12 diff hist +179‎ Prompt design ‎Aucun résumé des modifications
09:204 juillet 2023 à 09:20 diff hist +681‎ N Prompt design ‎ Page créée avec « Please note a few best practices around prompt design. Be concise Be specific and well-defined Ask one task at a time Turn generative tasks into classification tasks. For example, instead of asking what programming language to learn, ask if Python, Java, or C is a better fit for a beginner in programming. and Improve response quality by including examples. Adding instructions and a few examples tends to yield good results however there’s currently no one best w... »
08:584 juillet 2023 à 08:58 diff hist +2 571‎ N Zero-shot ‎ Page créée avec « Zero-shot prompting - is a method where the LLM is given no additional data on the specifictask that it is being asked to perform. Instead, it is only given a prompt that describes the task. For example, if you want the LLM to answer a question, you just prompt "what is prompt design?". One-shot prompting - is a method where the LLM is given a single example of the task that it is being asked to perform. For example, if you want the LLM to write a poem, you might... »

08:3022 juin 2023 à 08:30 diff hist +3 088‎ N Top P ‎ Page créée avec « First, there are different models you can choose from. Each model is tuned to perform well on specific tasks. You can also specify the temperature, top P, and top K. These parameters all adjust the randomness of responses by controlling how the output tokens are selected. When you send a prompt to the model, it produces an array of probabilities over the words that could come next. And from this array, we need some strategy to decide what to return. A simple stra... » actuelle
08:2822 juin 2023 à 08:28 diff hist +2 291‎ N Top K ‎ Page créée avec « First, there are different models you can choose from. Each model is tuned to perform well on specific tasks. You can also specify the temperature, top P, and top K. These parameters all adjust the randomness of responses by controlling how the output tokens are selected. When you send a prompt to the model, it produces an array of probabilities over the words that could come next. And from this array, we need some strategy to decide what to return. A simple stra... » actuelle
08:2422 juin 2023 à 08:24 diff hist +1 620‎ N Temperature ‎ Page créée avec « First, there are different models you can choose from. Each model is tuned to perform well on specific tasks. You can also specify the temperature, top P, and top K. These parameters all adjust the randomness of responses by controlling how the output tokens are selected. When you send a prompt to the model, it produces an array of probabilities over the words that could come next. And from this array, we need some strategy to decide what to return. A simple stra... »

04:0311 mai 2023 à 04:03 diff hist +3‎ LangChain ‎Aucun résumé des modifications actuelle
04:0311 mai 2023 à 04:03 diff hist +481‎ LangChain ‎Aucun résumé des modifications
03:4511 mai 2023 à 03:45 diff hist +240‎ LangChain ‎Aucun résumé des modifications
03:3011 mai 2023 à 03:30 diff hist +211‎ N LangChain ‎ Page créée avec «  == Références == * [https://python.langchain.com/en/latest/index.html] Documentation LangChain * [https://www.youtube.com/watch?v=2xxziIWmaSA&list=PLqZXAkvF1bPNQER9mLmDbntNfSpzdDIU5&index=4] Cours LangChain »

12:432 mai 2023 à 12:43 diff hist +1 569‎ Prompt ‎Aucun résumé des modifications actuelle
09:522 mai 2023 à 09:52 diff hist +182‎ Prompt ‎Aucun résumé des modifications
09:372 mai 2023 à 09:37 diff hist +665‎ Prompt ‎Aucun résumé des modifications
09:252 mai 2023 à 09:25 diff hist +337‎ N LLaMA ‎ Page créée avec « LLaMA (Large Language Model Meta AI) is a language model released by Meta (Facebook). It is Meta’s answer to OpenAI’s GPT models. Like GPT, LLaMA is intended to be a general-purpose foundational model suitable for further fine-tuning. [1] == Références == * [https://agi-sphere.com/llama-models/] A brief history of LLaMA models » actuelle
08:562 mai 2023 à 08:56 diff hist +67‎ Prompt ‎Aucun résumé des modifications

08:021 mai 2023 à 08:02 diff hist +43‎ N Fichier:Cropped-cropped-LogoBackPropTrans-300x120-3.png ‎ Logo BackProp transparent actuelle
07:361 mai 2023 à 07:36 diff hist −1‎ Prompt ‎Aucun résumé des modifications
07:331 mai 2023 à 07:33 diff hist +833‎ Prompt ‎Aucun résumé des modifications
07:291 mai 2023 à 07:29 diff hist +548‎ N Prompt ‎ Page créée avec «  == Références == * [https://learn.microsoft.com/en-us/azure/cognitive-services/openai/concepts/prompt-engineering] Introduction to prompt engineering * [https://www.deeplearning.ai/short-courses/chatgpt-prompt-engineering-for-developers/] ChatGPT Prompt Engineering for Developers * [https://learn.microsoft.com/en-us/azure/cognitive-services/openai/concepts/advanced-prompt-engineering?pivots=programming-language-chat-completions] Prompt engineering techniques... »

16:4027 avril 2023 à 16:40 diff hist +51‎ In-context learning ‎Aucun résumé des modifications actuelle Balise : Révocation manuelle
16:4027 avril 2023 à 16:40 diff hist −51‎ In-context learning ‎Aucun résumé des modifications Balise : Révoqué
16:3827 avril 2023 à 16:38 diff hist +111‎ In-context learning ‎Aucun résumé des modifications
16:3627 avril 2023 à 16:36 diff hist +110‎ N Fichier:Capture d’écran 2023-04-27 à 18.32.50.png ‎ A comparative illustration of in-context learning (ICL) and chain-of-thought (CoT) prompting actuelle
16:3527 avril 2023 à 16:35 diff hist +460‎ N In-context learning ‎ Page créée avec «  [1] The in-context learning (ICL) ability is formally introduced by GPT-3 : assuming that the language model has been provided with a natural language instruction and/or several task demonstrations, it can generate the expected output for the test instances by completing the word sequence of input text, without requiring additional training or gradient update == Références == * [https://arxiv.org/pdf/2303.18223.pdf] A Survey of Large Language Models »
16:2327 avril 2023 à 16:23 diff hist +331‎ Emergent Abilities of Large Language Models ‎Aucun résumé des modifications actuelle
15:5827 avril 2023 à 15:58 diff hist +13‎ Emergent Abilities of Large Language Models ‎Aucun résumé des modifications
15:3927 avril 2023 à 15:39 diff hist +1 239‎ N Emergent Abilities of Large Language Models ‎ Page créée avec « On entend par "Emergent Abilities of Large Language Models" une capacité présente dans un LLM qui ne se retrouve pas dans un modèle similaire mais plus petit. Ce qui veut dire aussi qu'on ne peut pas prévoir (extrapoler) cette nouvelle capacité uniquement à partir de celles d'un modèle plus petit. [1] We consider an ability to be emergent if it is not present in smaller models but is present in larger models. Although scaling is mainly conducted in mode... »
14:5327 avril 2023 à 14:53 diff hist +226‎ Pre-trained language models ‎Aucun résumé des modifications actuelle
14:4427 avril 2023 à 14:44 diff hist +1 080‎ N Pre-trained language models ‎ Page créée avec « Les auteurs de "A Survey of Large Language Models" distinguent les "Pre-trained language models (PLM)" des Large language models (LLM). ELMo et BERT appartiendraient à la 1ère catégorie, un peu comme les pionniers des LLM. As an early attempt, ELMo was proposed to capture context-aware word representations by first pre-training a bidirectional LSTM (biLSTM) network (instead of learning fixed word representations) and then fine-tuning the biLSTM network acco... »
13:3127 avril 2023 à 13:31 diff hist +31‎ N Fichier:Cropped-LogoBackPropTrans-2.png ‎ Logo BackProp actuelle
13:2927 avril 2023 à 13:29 diff hist +39‎ N Fichier:Shamentality artificial intelligence beautiful textures and ter 545b2bf2-aece-4268-b49e-78dee7f78862.png ‎ AI logo by MidJourney actuelle

21:5922 janvier 2023 à 21:59 diff hist +83‎ N Reinforcement Learning from Human Feedback ‎ Page créée avec « Reinforcement Learning from Human Feedback (RLHF) https://huggingface.co/blog/rlhf » actuelle

16:2112 janvier 2023 à 16:21 diff hist +124‎ CLIP ‎Aucun résumé des modifications actuelle

16:2610 janvier 2023 à 16:26 diff hist −9‎ CLIP ‎Aucun résumé des modifications
16:2610 janvier 2023 à 16:26 diff hist −8‎ CLIP ‎Aucun résumé des modifications
16:2110 janvier 2023 à 16:21 diff hist +32‎ CLIP ‎Aucun résumé des modifications Balise : Révocation manuelle
15:5510 janvier 2023 à 15:55 diff hist −7‎ CLIP ‎Aucun résumé des modifications Balise : Révoqué
15:5510 janvier 2023 à 15:55 diff hist −25‎ CLIP ‎Aucun résumé des modifications Balise : Révoqué
15:5410 janvier 2023 à 15:54 diff hist −37‎ CLIP ‎Aucun résumé des modifications
15:5410 janvier 2023 à 15:54 diff hist +38‎ CLIP ‎Aucun résumé des modifications
15:5310 janvier 2023 à 15:53 diff hist −8‎ CLIP ‎Aucun résumé des modifications

Contributions de Jboscher

19 septembre 2023

1 août 2023

27 juillet 2023

5 juillet 2023

4 juillet 2023

22 juin 2023

11 mai 2023

2 mai 2023

1 mai 2023

27 avril 2023

22 janvier 2023

12 janvier 2023

10 janvier 2023

Menu de navigation

Rechercher