{"id":469,"date":"2023-01-04T12:36:16","date_gmt":"2023-01-04T18:36:16","guid":{"rendered":"https:\/\/janajm.com?p=469"},"modified":"2023-01-04T13:10:57","modified_gmt":"2023-01-04T19:10:57","slug":"chatgpt-context-is-king","status":"publish","type":"post","link":"https:\/\/janajm.com\/chatgpt-context-is-king\/","title":{"rendered":"Context is king: A prediction about the future of ChatGPT and other AI chatbots."},"content":{"rendered":"<p>On November 30, 2022, <a href=\"https:\/\/openai.com\">OpenAI<\/a> launched <a href=\"https:\/\/chat.openai.com\">ChatGPT<\/a>, or what is arguably the most advanced AI chatbot that has ever been made available to the public.<sup id=\"fnref:1\"><a href=\"#fn:1\" rel=\"footnote\">1<\/a><\/sup>\u00a0It hit 1 million users just 4 days later.<sup id=\"fnref:2\"><a href=\"#fn:2\" rel=\"footnote\">2<\/a><\/sup><\/p>\n<p>The days following its launch were marked by a wave of screenshots across social media showcasing ChatGPT\u2019s capabilities.<\/p>\n<div class='content-column one_half'><div style=\"padding-right:10px;\"><a href=\"https:\/\/www.reddit.com\/r\/ChatGPT\/comments\/zitw2v\/invent_a_new_type_of_color_and_describe_what_it\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-489\" src=\"https:\/\/janajm.com\/file\/1.png\" alt=\"\" width=\"720\" height=\"878\" srcset=\"https:\/\/janajm.com\/file\/1.png 720w, https:\/\/janajm.com\/file\/1-246x300.png 246w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/a><\/div><\/div>\n<div class='content-column one_half last_column'><a href=\"https:\/\/twitter.com\/levie\/status\/1600393857933246464\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-492\" src=\"https:\/\/janajm.com\/file\/4-1024x938.jpeg\" alt=\"\" width=\"720\" height=\"660\" srcset=\"https:\/\/janajm.com\/file\/4-1024x938.jpeg 1024w, https:\/\/janajm.com\/file\/4-300x275.jpeg 300w, https:\/\/janajm.com\/file\/4-768x704.jpeg 768w, https:\/\/janajm.com\/file\/4-1536x1407.jpeg 1536w, https:\/\/janajm.com\/file\/4.jpeg 1600w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/a><\/div><div class='clear_column'><\/div>\n<div class='content-column one_half'><div style=\"padding-right:10px;\"><a href=\"https:\/\/www.reddit.com\/r\/ChatGPT\/comments\/zkeg2d\/this_is_genius\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-491\" src=\"https:\/\/janajm.com\/file\/3.jpg\" alt=\"\" width=\"796\" height=\"497\" srcset=\"https:\/\/janajm.com\/file\/3.jpg 796w, https:\/\/janajm.com\/file\/3-300x187.jpg 300w, https:\/\/janajm.com\/file\/3-768x480.jpg 768w\" sizes=\"auto, (max-width: 796px) 100vw, 796px\" \/><\/a><\/div><\/div>\n<div class='content-column one_half last_column'><a href=\"https:\/\/www.reddit.com\/r\/ChatGPT\/comments\/zjs9sg\/this_is_insane\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-490\" src=\"https:\/\/janajm.com\/file\/2.png\" alt=\"\" width=\"705\" height=\"588\" srcset=\"https:\/\/janajm.com\/file\/2.png 705w, https:\/\/janajm.com\/file\/2-300x250.png 300w\" sizes=\"auto, (max-width: 705px) 100vw, 705px\" \/><\/a><\/div><div class='clear_column'><\/div>\n<p>It is, undeniably, an incredible technology with any number of <a href=\"https:\/\/medium.com\/@markwschaefer\/20-entertaining-uses-of-chatgpt-you-never-knew-were-possible-3bc2644d4507\">use cases<\/a>. Yet so is a lot of technology in the AI space. What made ChatGPT strike a chord in ways that similar technologies haven\u2019t was likely some combination of the following 3 factors:<\/p>\n<ol>\n<li>Unlike much of the latest work in AI, ChatGPT is freely<sup id=\"fnref:3\"><a href=\"#fn:3\" rel=\"footnote\">3<\/a><\/sup> available to the public.<\/li>\n<li>Unlike other similar technologies, ChatGPT is available to the public in a way that\u2019s simultaneously easy-to-implement and, once implemented, user-friendly. Accessing it doesn\u2019t require users to download any software, navigate a GitHub repository, or reach out to the authors of a study for access to their code, and using ChatGPT doesn&#8217;t require any expert knowledge or specialized skillsets.<\/li>\n<li>ChatGPT benefits from at least one aspect of what economists have termed \u2018<a href=\"https:\/\/www.investopedia.com\/terms\/n\/network-effect.asp\">the network effect<\/a>,\u2019 which describes the phenomenon whereby a service becomes better or more valuable as the number of people using that service increases. In the case of ChatGPT, this took shape in the form of users learning how best to use the technology from other users sharing how <i>they<\/i> had used the technology, which in turn encouraged more users to use it (and on and on and on).<\/li>\n<\/ol>\n<p>Things move quickly in this space, though, and there are already those who have begun looking beyond ChatGPT&#8217;s current form in anticipation of what future iterations of the technology might be able to offer. There are <a href=\"https:\/\/www.wired.com\/story\/cerebras-chip-cluster-neural-networks-ai\/\">rumours<\/a>, for example, that GPT-4 \u2014 the successor to GPT-3, of which ChatGPT is a variant \u2014 will feature 100 trillion parameters, but <a href=\"https:\/\/twitter.com\/Mascobot\/status\/1595077295144046593\">others<\/a> have <a href=\"https:\/\/thealgorithmicbridge.substack.com\/p\/gpt-4-rumors-from-silicon-valley\">challenged<\/a> this claim.<sup id=\"fnref:4\"><a href=\"#fn:4\" rel=\"footnote\">4<\/a><\/sup><\/p>\n<p>Regardless of their exact specs, the most common improvements \u2014 whether to ChatGPT or to similar technologies in the future \u2014 are likely to be increases in what might be termed \u2018reference material.\u2019 Future iterations of AI chatbots will, for example, be able to discuss a wider range of material, to cover that material at greater depth, and to feature more recent material than what is presently available.<\/p>\n<p>I predict that while these kinds of improvements will undoubtedly be useful, they won\u2019t be the breakthrough innovation that leads to mainstream adoption of ChatGPT or to major leaps in user benefit. Rather, the breakthrough innovation for AI chatbots will be an increase in what I\u2019m calling \u2018<strong>contextual capability<\/strong>.\u2019\u00a0Conceptually, contextual capability represents a complex set of processes which could be expressed in many different ways. For the sake of simplicity, however, I\u2019ll break this concept down to just three primary &#8216;levels.&#8217;<\/p>\n<p><strong>The first level of contextual capability<\/strong> is invisible to the user, as it represents things like the amount of training data and the number of parameters that an AI chatbot was developed on and can therefore utilize in preparing its responses. ChatGPT performs exceptionally well on this level. For now, perhaps its primary shortcoming on this level is the fact that, at the time of this writing, its training data only goes up to <a href=\"https:\/\/help.openai.com\/en\/articles\/6639781-do-the-openai-api-models-have-knowledge-of-current-events\">June of 2021<\/a><sup>\u2060<\/sup>. As a result, it can\u2019t refer to, describe, or otherwise engage with events or developments that took place beyond that point in time.<\/p>\n<p>Large models are not particularly impressive, novel, or distinct simply by virtue of their size, however.<\/p>\n<p><a href=\"https:\/\/twitter.com\/vboykis\/status\/1560297519879536644\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-500\" src=\"https:\/\/janajm.com\/file\/5-1024x602.png\" alt=\"\" width=\"500\" height=\"294\" srcset=\"https:\/\/janajm.com\/file\/5-1024x602.png 1024w, https:\/\/janajm.com\/file\/5-300x176.png 300w, https:\/\/janajm.com\/file\/5-768x451.png 768w, https:\/\/janajm.com\/file\/5.png 1191w\" sizes=\"auto, (max-width: 500px) 100vw, 500px\" \/><\/a><\/p>\n<p>Rather, it\u2019s with the second level of contextual capability that things really start to get interesting. <strong>The second level of contextual capability<\/strong> represents the ability of an AI chatbot to take in a certain level of context from a prompt and produce a response that takes into account all of that context. Or, in the words of ChatGPT:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-502\" src=\"https:\/\/janajm.com\/file\/6-1024x724.png\" alt=\"\" width=\"720\" height=\"509\" srcset=\"https:\/\/janajm.com\/file\/6-1024x724.png 1024w, https:\/\/janajm.com\/file\/6-300x212.png 300w, https:\/\/janajm.com\/file\/6-768x543.png 768w, https:\/\/janajm.com\/file\/6-1536x1086.png 1536w, https:\/\/janajm.com\/file\/6.png 1720w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>If you&#8217;ve been following along with ChatGPT since its launch, you might have noticed that many of the most compelling examples of ChatGPT&#8217;s responses have been the result of the most context-heavy prompts. We can demonstrate this by comparing ChatGPT\u2019s responses to the following prompts.<\/p>\n<p>1.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-504\" src=\"https:\/\/janajm.com\/file\/7-1024x677.png\" alt=\"\" width=\"720\" height=\"476\" srcset=\"https:\/\/janajm.com\/file\/7-1024x677.png 1024w, https:\/\/janajm.com\/file\/7-300x198.png 300w, https:\/\/janajm.com\/file\/7-768x507.png 768w, https:\/\/janajm.com\/file\/7-1536x1015.png 1536w, https:\/\/janajm.com\/file\/7.png 1786w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-505\" src=\"https:\/\/janajm.com\/file\/8-1024x584.png\" alt=\"\" width=\"720\" height=\"411\" srcset=\"https:\/\/janajm.com\/file\/8-1024x584.png 1024w, https:\/\/janajm.com\/file\/8-300x171.png 300w, https:\/\/janajm.com\/file\/8-768x438.png 768w, https:\/\/janajm.com\/file\/8-1536x876.png 1536w, https:\/\/janajm.com\/file\/8.png 1782w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>2.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-509\" src=\"https:\/\/janajm.com\/file\/9-1024x700.png\" alt=\"\" width=\"720\" height=\"492\" srcset=\"https:\/\/janajm.com\/file\/9-1024x700.png 1024w, https:\/\/janajm.com\/file\/9-300x205.png 300w, https:\/\/janajm.com\/file\/9-768x525.png 768w, https:\/\/janajm.com\/file\/9-1536x1050.png 1536w, https:\/\/janajm.com\/file\/9.png 1726w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-510\" src=\"https:\/\/janajm.com\/file\/10-1024x608.png\" alt=\"\" width=\"720\" height=\"428\" srcset=\"https:\/\/janajm.com\/file\/10-1024x608.png 1024w, https:\/\/janajm.com\/file\/10-300x178.png 300w, https:\/\/janajm.com\/file\/10-768x456.png 768w, https:\/\/janajm.com\/file\/10-1536x912.png 1536w, https:\/\/janajm.com\/file\/10.png 1718w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>3.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-512\" src=\"https:\/\/janajm.com\/file\/11-1024x738.png\" alt=\"\" width=\"720\" height=\"519\" srcset=\"https:\/\/janajm.com\/file\/11-1024x738.png 1024w, https:\/\/janajm.com\/file\/11-300x216.png 300w, https:\/\/janajm.com\/file\/11-768x553.png 768w, https:\/\/janajm.com\/file\/11-1536x1106.png 1536w, https:\/\/janajm.com\/file\/11.png 1752w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-513\" src=\"https:\/\/janajm.com\/file\/12-1024x610.png\" alt=\"\" width=\"720\" height=\"429\" srcset=\"https:\/\/janajm.com\/file\/12-1024x610.png 1024w, https:\/\/janajm.com\/file\/12-300x179.png 300w, https:\/\/janajm.com\/file\/12-768x457.png 768w, https:\/\/janajm.com\/file\/12-1536x915.png 1536w, https:\/\/janajm.com\/file\/12.png 1708w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>Here\u2019s another example.<\/p>\n<p>1.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-515\" src=\"https:\/\/janajm.com\/file\/13-1024x692.png\" alt=\"\" width=\"720\" height=\"487\" srcset=\"https:\/\/janajm.com\/file\/13-1024x692.png 1024w, https:\/\/janajm.com\/file\/13-300x203.png 300w, https:\/\/janajm.com\/file\/13-768x519.png 768w, https:\/\/janajm.com\/file\/13-1536x1038.png 1536w, https:\/\/janajm.com\/file\/13.png 1740w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>2.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-516\" src=\"https:\/\/janajm.com\/file\/14-1024x829.png\" alt=\"\" width=\"720\" height=\"583\" srcset=\"https:\/\/janajm.com\/file\/14-1024x829.png 1024w, https:\/\/janajm.com\/file\/14-300x243.png 300w, https:\/\/janajm.com\/file\/14-768x621.png 768w, https:\/\/janajm.com\/file\/14-1536x1243.png 1536w, https:\/\/janajm.com\/file\/14.png 1708w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>3.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-517\" src=\"https:\/\/janajm.com\/file\/15-1024x792.png\" alt=\"\" width=\"720\" height=\"557\" srcset=\"https:\/\/janajm.com\/file\/15-1024x792.png 1024w, https:\/\/janajm.com\/file\/15-300x232.png 300w, https:\/\/janajm.com\/file\/15-768x594.png 768w, https:\/\/janajm.com\/file\/15-1536x1188.png 1536w, https:\/\/janajm.com\/file\/15.png 1706w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-518\" src=\"https:\/\/janajm.com\/file\/16-1024x743.png\" alt=\"\" width=\"720\" height=\"522\" srcset=\"https:\/\/janajm.com\/file\/16-1024x743.png 1024w, https:\/\/janajm.com\/file\/16-300x218.png 300w, https:\/\/janajm.com\/file\/16-768x558.png 768w, https:\/\/janajm.com\/file\/16-1536x1115.png 1536w, https:\/\/janajm.com\/file\/16.png 1744w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>There are a few issues with ChatGPT\u2019s responses to the above prompts. But, for the moment, let\u2019s focus on how its responses become increasingly more compelling as more and more context is added to the prompt.<\/p>\n<p>This leads us to <strong>the third level of contextual capability<\/strong>, which might be termed \u2018the world-building level.\u2019 This level represents the extent to which ChatGPT is able to take in a user\u2019s contextual variables (as presented to it through prompts), produce a response that takes into account those variables, and then store both the user-provided prompts and its own responses to those prompts for the remainder of the session.<\/p>\n<p>As an example of what this looks like in practice, let\u2019s return to our song about Phineas, the anxious goose. ChatGPT has already provided us with a wonderful response to our initial prompt, but we\u2019d like to remain in the world of Phineas and his goose-related troubles for just a little while longer. Perhaps we\u2019re a songwriter who has been tasked with writing an album about Phineas, and as a result we need more material than just the one song. Let\u2019s see what we can do.<\/p>\n<p>1.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-520\" src=\"https:\/\/janajm.com\/file\/17-1024x826.png\" alt=\"\" width=\"720\" height=\"581\" srcset=\"https:\/\/janajm.com\/file\/17-1024x826.png 1024w, https:\/\/janajm.com\/file\/17-300x242.png 300w, https:\/\/janajm.com\/file\/17-768x619.png 768w, https:\/\/janajm.com\/file\/17-1536x1239.png 1536w, https:\/\/janajm.com\/file\/17.png 1684w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-521\" src=\"https:\/\/janajm.com\/file\/18-1024x616.png\" alt=\"\" width=\"720\" height=\"433\" srcset=\"https:\/\/janajm.com\/file\/18-1024x616.png 1024w, https:\/\/janajm.com\/file\/18-300x180.png 300w, https:\/\/janajm.com\/file\/18-768x462.png 768w, https:\/\/janajm.com\/file\/18-1536x924.png 1536w, https:\/\/janajm.com\/file\/18.png 1686w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>2.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-522\" src=\"https:\/\/janajm.com\/file\/19-1024x735.png\" alt=\"\" width=\"720\" height=\"517\" srcset=\"https:\/\/janajm.com\/file\/19-1024x735.png 1024w, https:\/\/janajm.com\/file\/19-300x215.png 300w, https:\/\/janajm.com\/file\/19-768x551.png 768w, https:\/\/janajm.com\/file\/19-1536x1103.png 1536w, https:\/\/janajm.com\/file\/19.png 1694w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-523\" src=\"https:\/\/janajm.com\/file\/20-1024x604.png\" alt=\"\" width=\"720\" height=\"425\" srcset=\"https:\/\/janajm.com\/file\/20-1024x604.png 1024w, https:\/\/janajm.com\/file\/20-300x177.png 300w, https:\/\/janajm.com\/file\/20-768x453.png 768w, https:\/\/janajm.com\/file\/20-1536x907.png 1536w, https:\/\/janajm.com\/file\/20.png 1728w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>3.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-524\" src=\"https:\/\/janajm.com\/file\/21-1024x253.png\" alt=\"\" width=\"720\" height=\"178\" srcset=\"https:\/\/janajm.com\/file\/21-1024x253.png 1024w, https:\/\/janajm.com\/file\/21-300x74.png 300w, https:\/\/janajm.com\/file\/21-768x190.png 768w, https:\/\/janajm.com\/file\/21-1536x380.png 1536w, https:\/\/janajm.com\/file\/21.png 1724w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>4.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-525\" src=\"https:\/\/janajm.com\/file\/22-1024x736.png\" alt=\"\" width=\"720\" height=\"518\" srcset=\"https:\/\/janajm.com\/file\/22-1024x736.png 1024w, https:\/\/janajm.com\/file\/22-300x216.png 300w, https:\/\/janajm.com\/file\/22-768x552.png 768w, https:\/\/janajm.com\/file\/22-1536x1103.png 1536w, https:\/\/janajm.com\/file\/22.png 1690w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-526\" src=\"https:\/\/janajm.com\/file\/23-1024x600.png\" alt=\"\" width=\"720\" height=\"422\" srcset=\"https:\/\/janajm.com\/file\/23-1024x600.png 1024w, https:\/\/janajm.com\/file\/23-300x176.png 300w, https:\/\/janajm.com\/file\/23-768x450.png 768w, https:\/\/janajm.com\/file\/23-1536x900.png 1536w, https:\/\/janajm.com\/file\/23.png 1734w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>Truly, it\u2019s remarkable what ChatGPT has been able to produce in response to these follow-up prompts in world-building. And yet, as we can see, ChatGPT hasn\u2019t quite fulfilled the requirements of the third level of contextual capability. Remember, for example, that Phineas and Greta met in the winter, not in the spring. It doesn\u2019t make sense, moreover, that they would be flying south in the spring, as springtime is when they would be returning north.<\/p>\n<p>When we point out these errors to ChatGPT, it\u2019s able to produce a more accurate revision:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-528\" src=\"https:\/\/janajm.com\/file\/24-1024x786.png\" alt=\"\" width=\"720\" height=\"553\" srcset=\"https:\/\/janajm.com\/file\/24-1024x786.png 1024w, https:\/\/janajm.com\/file\/24-300x230.png 300w, https:\/\/janajm.com\/file\/24-768x590.png 768w, https:\/\/janajm.com\/file\/24-1536x1179.png 1536w, https:\/\/janajm.com\/file\/24.png 1704w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-529\" src=\"https:\/\/janajm.com\/file\/25-1024x607.png\" alt=\"\" width=\"720\" height=\"427\" srcset=\"https:\/\/janajm.com\/file\/25-1024x607.png 1024w, https:\/\/janajm.com\/file\/25-300x178.png 300w, https:\/\/janajm.com\/file\/25-768x455.png 768w, https:\/\/janajm.com\/file\/25-1536x910.png 1536w, https:\/\/janajm.com\/file\/25.png 1720w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>In fulfilling the third level of contextual capability, however, the question is not whether ChatGPT can correct its responses with our feedback. The question is whether ChatGPT can take in our initial contextual variables, produce a response that takes into account those variables, and then store both our variable-containing prompts and its responses to those prompts for the remainder of the session as we add to or otherwise change our initial variables. Already, as we have seen, ChatGPT is able to do this to some extent. But I predict that future iterations of this technology will be significantly more adept at doing so.<\/p>\n<p>In the FAQ for ChatGPT, OpenAI <a href=\"https:\/\/help.openai.com\/en\/articles\/6787051-does-chatgpt-remember-what-happened-earlier-in-the-conversation\">states<\/a> that ChatGPT \u201cis able to remember what the user has said earlier in the conversation \u2026 up to approximately 3000 words (or 4000 tokens)\u201d in the past. \u201cAny information beyond that,\u201d they explain, \u201cis not stored,\u201d and \u201cChatGPT is not able to access past conversations to inform its responses.\u201d This functionality of third-level contextual capability, in other words, has already been built into the model, which indicates that it was seen to be a sufficiently valuable feature by its developers. In a model of this scale and complexity, that certainly wasn\u2019t a given, as it would have required considerable resources of time and expertise to build out. With the above response, moreover, there is the suggestion that those who are working on further improving ChatGPT recognize that its capabilities in this regard are currently somewhat lacking relative to what its users would ideally like for it to be able to do.<\/p>\n<p>The reason why future improvements in contextual capability will be important in establishing ChatGPT as a breakthrough innovation is suggested both by the above question and by OpenAI\u2019s response. Let\u2019s return to them both now, this time in their full extent. See if you can spot it:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-530\" src=\"https:\/\/janajm.com\/file\/26-1024x750.png\" alt=\"\" width=\"720\" height=\"527\" srcset=\"https:\/\/janajm.com\/file\/26-1024x750.png 1024w, https:\/\/janajm.com\/file\/26-300x220.png 300w, https:\/\/janajm.com\/file\/26-768x562.png 768w, https:\/\/janajm.com\/file\/26-1536x1125.png 1536w, https:\/\/janajm.com\/file\/26.png 1994w\" sizes=\"auto, (max-width: 720px) 100vw, 720px\" \/><\/p>\n<p>It comes down to one key word: <i>remember<\/i>.<\/p>\n<p>The act of remembering is a practice that\u2019s unique to living organisms. Humans, for example, will remember to pick up the dry cleaning on their way home from work. (Or, at least, they\u2019ll try to.) A dog will remember the sound of their name and perk up when it\u2019s called. Even cells \u2014 the smallest unit of an organism \u2014 are <a href=\"https:\/\/www.scientificamerican.com\/article\/can-a-cell-remember\/\">capable<\/a> of remembering. But computers? Computers don\u2019t remember things. Computers store information. With the right prompts, computers are able to retrieve and then present that information to their users.<\/p>\n<p>Why, then, would OpenAI use the term \u2018remember\u2019 to describe what ChatGPT is capable of doing? I suspect that, in part, it\u2019s a gesture towards a future in which ChatGPT has undergone significant improvements in contextual capability.<\/p>\n<p>Think of it this way: you meet a friend for coffee, and you have so much to talk about that you end up staying out with them for hours. At the end of that conversation, your friend will still remember things that you said to them at the beginning of the conversation. They might not remember every single detail, but even weeks or months later they will still remember enough about your conversation to be able to both refer back to and build upon what you shared \u2014 and you\u2019ll be able to do the same for them. That conversation, in other words, will have entered into the world-building of your friendship.<\/p>\n<p>Now, think of the possibilities involved in having even a fraction of that level of contextual capability with an AI chatbot. It won&#8217;t need to remember weeks\u2019 or months\u2019 worth of contextual variables in that way that your friend does. Imagine, for example, if ChatGPT could simply remember the contextual variables from an interaction that spanned the entirety of a single day. My sense is that this functionality is coming, and that it will be here sooner than we might expect.<\/p>\n<p>In the near future, you&#8217;ll boot up ChatGPT at the beginning of your workday, during which time you&#8217;ll initiate your world-building for the day\u2019s tasks with a series of context-heavy prompts that will include any domain-specific or organizational-specific variables it will need to know in order to assist you in those tasks. (e.g., \u201cWith this iteration of the project, we\u2019re optimizing for cost efficiency.\u201d) From there, you&#8217;ll keep interacting with ChatGPT over the course of the day as you work to complete your tasks, and all throughout your exchanges it will remember each update or addition to the initial set of contextual variables you provided.<\/p>\n<p>It will be like working not just with an expert in your field, but with an expert in your specific subfield of your field who also happens to be working the same job \u2014 at the same company, with the same team, and on the same project \u2014 as you.<\/p>\n<hr style=\"width: 50%; height: 2px\">\n<div style=\"font-size: 18px;\">\n<p id=\"fn:1\"><sup>1 <\/sup>The <em>New York Times<\/em> <a href=\"https:\/\/www.nytimes.com\/2022\/12\/05\/technology\/chatgpt-ai-twitter.html\">has called it<\/a> &#8220;the best artificial intelligence chatbot ever released to the general public.&#8221;<a href=\"#fnref:1\" rev=\"footnote\">\u21a9<\/a><\/p>\n<p id=\"fn:2\"><sup>2 <\/sup>It\u2019s worth noting that, in order to gain access to ChatGPT, users have to supply not only an email address but also an active phone number. This, in turn, makes the fact of ChatGPT hitting the 1-million-user mark so quickly all the more remarkable.<a href=\"#fnref:2\" rev=\"footnote\">\u21a9<\/a><\/p>\n<p id=\"fn:3\"><sup>3 <\/sup>This is likely to change in future iterations of ChatGPT, as OpenAI has <a href=\"https:\/\/help.openai.com\/en\/articles\/6783457-chatgpt-faq\">suggested<\/a>.<a href=\"#fnref:3\" rev=\"footnote\">\u21a9<\/a><\/p>\n<p id=\"fn:4\"><sup>4 <\/sup>See also <a href=\"https:\/\/www.alignmentforum.org\/posts\/6Fpvch8RR29qLEWNH\/chinchilla-s-wild-implications\">this post<\/a> on language model scaling.<a href=\"#fnref:4\" rev=\"footnote\">\u21a9<\/a><\/p>\n<p>Featured image: Suzuki Kiitsu&#8217;s <a href=\"https:\/\/www.metmuseum.org\/art\/collection\/search\/48982\">&#8220;Morning Glories&#8221;<\/a> (c. 1800)<\/p>\n<\/div>\n<hr \/>\n<p><strong>Jana M. Perkins<\/strong> is a computational social scientist. An award-winning scholar, her research has been federally funded by the Social Sciences and Humanities Research Council of Canada since 2019. She is the founder of <a href=\"https:\/\/womenofletters.substack.com\"><em>Women of Letters<\/em><\/a>, a longform interview series celebrating women\u2019s paths to professional success. Together with Miranda Dunham-Hickman, she is co-authoring <a href=\"https:\/\/janajm.com\/deep-literacy-digital-time\/\">a book<\/a> that will be published by Routledge.<\/p>\n<p>To learn more about Perkins and her latest work, visit <a href=\"https:\/\/janajm.com\">janajm.com<\/a> or follow her on <a href=\"https:\/\/bsky.app\/profile\/janajm.com\">Bluesky<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>It will be like working not just with an expert in your field, but with an expert in your specific subfield of your field who also happens to be working the same job as you.<\/p>\n","protected":false},"author":1,"featured_media":470,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-469","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/janajm.com\/rest\/wp\/v2\/posts\/469","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/janajm.com\/rest\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/janajm.com\/rest\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/janajm.com\/rest\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/janajm.com\/rest\/wp\/v2\/comments?post=469"}],"version-history":[{"count":58,"href":"https:\/\/janajm.com\/rest\/wp\/v2\/posts\/469\/revisions"}],"predecessor-version":[{"id":610,"href":"https:\/\/janajm.com\/rest\/wp\/v2\/posts\/469\/revisions\/610"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/janajm.com\/rest\/wp\/v2\/media\/470"}],"wp:attachment":[{"href":"https:\/\/janajm.com\/rest\/wp\/v2\/media?parent=469"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/janajm.com\/rest\/wp\/v2\/categories?post=469"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/janajm.com\/rest\/wp\/v2\/tags?post=469"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}