Podcast - Liquid Network - LLMs for Bitcoin Development with Spirit of Satoshi
01 Apr 2024
Reading time ~2 minutes
Esta apresentação no canal Liquid Network mostra uma fase específica do trabalho no Spirit of Satoshi: sair do modelo educacional geral sobre Bitcoin e criar ferramentas mais técnicas para gente que precisa programar, ler documentação e depurar código no ecossistema Liquid. A abertura é feita pelo Jeff, explicando por que as equipes com quem eles conversavam sentiam falta de um assistente menos genérico do que Copilot ou um chatbot mainstream.
Capítulos por assunto
- 00:00 Por que LLMs genericos falham para Bitcoin dev
- 06:50 Satoshi educacional e o modelo de proof of knowledge
- 14:47 Onboarding tecnico em Liquid e onde doi
- 23:55 Assistente tecnico de Liquid e consulta a documentacao
- 32:59 Code Satoshi e a demo de Miniscript
- 40:59 Treinamento, alinhamento e vies fiduciario
O episódio gira em torno de três peças diferentes:
- o
Satoshi, modelo mais amplo para educação e perguntas gerais sobre Bitcoin; - o assistente técnico de
Liquid, que responde em cima de documentação e mostra links para as fontes; - o
Code Satoshi, especializado emMiniscript, capaz de escrever, corrigir e explicar políticas e scripts.
Na parte que eu apresento, o ponto mais importante não é “olha que demo legal”, mas a tentativa de reduzir o atrito real de desenvolvimento. O Code Satoshi recebe pedidos em linguagem natural, propõe políticas de Miniscript, mostra a estrutura equivalente e até renderiza fluxogramas para deixar explícitos os ramos de gasto. Para um ecossistema em que muita gente ainda entra por documentação dispersa, isso encurta bastante o caminho entre curiosidade e código utilizável.
Outra parte boa da conversa é a discussão sobre alinhamento e treinamento. O Jeff descreve o uso do proof of knowledge, um fluxo em que bitcoiners ajudam a curar e anotar dados recebendo micropagamentos em Lightning. Depois a conversa entra em fine-tuning, continued pre-training, repositórios próprios como a Nakamoto Repository e o problema nada trivial de tentar reduzir o viés fiduciário de modelos de base treinados na internet inteira.
No final eu entro num ponto que continua atual: modelos não “esquecem” de forma limpa só porque você quer, e mexer neles quase sempre é mais experimental do que o marketing de IA faz parecer. Essa parte deixa a página mais útil do que um simples embed porque documenta não só o produto mostrado, mas o tipo de raciocínio técnico e epistemológico por trás dele.
Transcription (experimental)
Fonte: YouTube
- 00:11 SPEAKER_01 Hi, I'm Jeff, uh product manager at Square Satoshi.
- 00:14 SPEAKER_01 Also have Breno here, uh one of our uh... uh leader uh developers uh he'll be assisting with uh presentation portion about the code assistant.
- 00:23 SPEAKER_01 But yes, we've been hard at work um building some large language models, and uh we are a bit bitcoin centric um yeah company now I guess uh a little bit of a background context.
- 00:38 SPEAKER_01 As you know, there's a lot of different models out there.
- 00:41 SPEAKER_01 Uh you know, Google OpenAI Meta, kind of the leading companies in the industry right now, uh, but there are some inherent problems or challenges there.
- 00:49 SPEAKER_01 Um we've seen some very uh uh public facing examples of that recently with uh kind of the catastrophe of Google's recent large language model release.
- 01:00 SPEAKER_01 Uh but we are here to build some models that have a little bit different of a bias in them.
- 01:06 SPEAKER_01 Um some people are concerned with having a model that has any kind of a bias and think that they should be completely neutral, but the reality is that's just not possible to do.
- 01:15 SPEAKER_01 Um every training data set is gonna have a specific bias in it.
- 01:18 SPEAKER_01 So our goal is to present uh large language model that does not have a fiat bias baked in, uh, but instead uh presents more of a Austrian economics uh bitcoin centric uh emphasis to the world.
- 01:32 SPEAKER_01 So but today um we'll give a little bit of a background uh about code Satoshi, why we built it, um, and then also a more generic view of uh first off, like what we were building before we got into that.
- 01:48 SPEAKER_01 So uh first off, um, can you guys see my screen?
- 01:54 SPEAKER_01 We can.
- 01:55 SPEAKER_01 That's great.
- 01:56 SPEAKER_01 Okay, yeah.
- 01:57 SPEAKER_01 So we actually started uh building uh just
- 01:58 SPEAKER_01 And uh building uh just our main Satoshi model, which is just a I guess you could call it generic, but just a uh a chap for learning and education about Bitcoin generally.
- 02:10 SPEAKER_01 Uh we also released uh Satoshi GPT in the GPT store uh that you can go and find today.
- 02:16 SPEAKER_01 And then on our website, uh we have uh an open source uh fine-tuned Lama213b that's powering that today.
- 02:24 SPEAKER_01 So this enables you to just go and ask any question about a bit more Bitcoin generally.
- 02:30 SPEAKER_01 Uh really helpful for sending your normie friends to it uh who ask you questions like you probably if you've been in Bitcoin a while, been used to answering the same questions over and over again about Bitcoin, uh, which is fine.
- 02:42 SPEAKER_01 Like orange pulling people is is fun and exciting, but it's uh hard to scale.
- 02:47 SPEAKER_01 And so this is a really good tool that we can share with the world to help educate people about Bitcoin, um, and doesn't tell them to you know diversify and buy Ethereum and stuff like that.
- 02:59 SPEAKER_01 So uh Satoshi GPT, we also gave it some really cool capabilities, like you can ask it all types of questions about the Bitcoin price, uh estimating the halving, finding Bitcoin merchants, summarizing the news, building charts, uh some really cool stuff that we gave it the ability to do.
- 03:15 SPEAKER_01 Um so yeah, go ahead and hop in uh to our website and you can play around with that.
- 03:20 SPEAKER_01 It's spiritual satoshi.ai.
- 03:22 SPEAKER_01 Um, or in the GPT store, you can find uh Satoshi.
- 03:26 SPEAKER_01 Uh but what we also found in um this process is that and and with any MLM is it requires you to gather a lot of data, but then also to build and structure that into a very clean data set that needs to be used to to train the model and to iterate on.
- 03:45 SPEAKER_01 Um that also requires some instruction fine-tuning and evaluation, like you need to be able to actually assess if what you've done uh achieves a specific outcome, especially because the data sets involved are just so large that it's almost like
- 03:56 SPEAKER_01 Involved are just so large that it's almost impossible to evaluate it with uh human eyes.
- 04:02 SPEAKER_01 You know, getting to tens or hundreds of thousands of data points, you need to have uh an automated way to evaluate that.
- 04:08 SPEAKER_01 Uh, but this also requires uh human feedback as well, um, because any kind of automated evaluation is gonna have some limitations to it.
- 04:18 SPEAKER_01 So what we wanted to do is be able to crowd source uh uh the knowledge of the bitcoin community.
- 04:24 SPEAKER_01 Uh we call it proof of knowledge, and so what we built is uh a lightning enabled crowdsourcing LLM tool.
- 04:29 SPEAKER_01 So it allows Bitcoiners to come in uh to receive uh a data point.
- 04:36 SPEAKER_01 Uh often that's just a question and answer pair, and they can use their knowledge to help us curate and clean and annotate this data so that we can use that in training.
- 04:45 SPEAKER_01 And it's uh it's a really cool opportunity because it means that Bitcoiners all over the world can get involved um and be able to earn stats for the knowledge that they've gained learning about Bitcoin, and in the process help us to build and fine-tune a model that helps share that knowledge with the world.
- 04:59 SPEAKER_01 And Bitcoin micropayments is really what's made that possible.
- 05:02 SPEAKER_01 I mean, there's there's other like AI data sourcing tools out there, but they have the inherent limitations of fiat, which is like cross-border payments and small transaction sizes.
- 05:11 SPEAKER_01 So uh leveraging Bitcoin, we've actually been able to make a tool that improves the knowledge of the Bitcoin model as well.
- 05:20 SPEAKER_01 Um what we did though is we actually started talking to a lot of companies about how they can leverage um um AI to help their company, and what we found uh specifically with the liquid community is liquid community is that there was a need there for developers.
- 05:37 SPEAKER_01 Um, some of the feedback we got with our conversations.
- 05:41 SPEAKER_01 There's just a lot of uh uh it's a complex uh industry, complex technology, um, and it's challenging for a lot of developers to get to get started, especially because uh it's just a newer technology, there's not um you know extensive uh uh uh documentation or knowledge sharing that's out there yet because it's so new.
- 06:01 SPEAKER_01 Um, and so a lot of the companies we talked to just expressed some of the challenges they were having, and so we we found that there's a really good uh opportunity here to help uh developers in the space uh uh get started more quickly, and so that's kind of what started our our desire to build a liquid and uh code assistant tool.
- 06:23 SPEAKER_01 Um, so yeah, it's uh very useful.
- 06:27 SPEAKER_01 Um it's been proven out there with tools like Copilot, which has been uh adopted heavily by developers, that these can be effective tools for saving time, and saving time is uh saving money because engineers are one of the largest expenses for uh uh modern tech companies.
- 06:45 SPEAKER_01 But the problem is is that um there's not Bitcoin centric code assistance out there.
- 06:50 SPEAKER_01 So if you go and ask existing um large language models like Copilot or something, very specific questions about Bitcoin script, mini script, uh liquid development.
- 07:01 SPEAKER_01 Uh sometimes they'll do okay, but often they they they they they they they they they they they just hallucinate or or fall flat on their face, they just make things up or they just don't know the answers.
- 07:10 SPEAKER_01 Uh and especially as this ecosystem grows, there'll be uh just a greater need for incoming engineers to get up to speed more quickly.
- 07:19 SPEAKER_01 Um, and that's what we're hoping to do is to help bring the thousands of new bitcoin engineers that will need to enable hyperbitcoinization uh and give them the tools they need to to do so quickly.
- 07:31 SPEAKER_01 Um yeah, and then also in in addition to the actual mini script code assistant, uh, we also have developed a liquid assistant, which helps to explain uh Bitcoin and liquid related documentation.
- 07:45 SPEAKER_01 Um instead of just spending time like trying to pour through lots and lots of documentation here, you could just quickly ask a question.
- 07:55 SPEAKER_01 You know, there's uh some existing tools that liquid developers use to interact with uh industry experts.
- 08:01 SPEAKER_01 Um, but that's just there's just limited time there uh for people to help out.
- 08:07 SPEAKER_01 And here's a tool that you can just ask at any time.
- 08:10 SPEAKER_01 Um instead of having to search for documentation, you just ask your question, you can get the answer in minutes uh instead of uh... uh the hours it might take to pour through uh documentation or wait for our assistance.
- 08:22 SPEAKER_01 Uh so we have those two, yeah.
- 08:23 SPEAKER_01 We have the liquid assistant and then uh the code Satoshi.
- 08:27 SPEAKER_01 So right now um we've built our first model that specializes in mini script.
- 08:32 SPEAKER_01 It can help you to write code, it can help you to correct code, and it can help to explain the code, and uh we'll demonstrate all that uh uh uh shortly.
- 08:41 SPEAKER_01 Uh but this could obviously be expanded to additional uh... uh languages uh such as uh Bitcoin script specifically or or other things in the Bitcoin community that we could help with.
- 08:51 SPEAKER_01 For now, we just started with uh a mini script assistant.
- 08:55 SPEAKER_01 Um, it's one of the more simple languages that uh uh we could start with.
- 09:01 SPEAKER_01 So yeah, let's hop into uh a demo real quick.
- 09:05 SPEAKER_01 I will share my screen.
- 09:08 SPEAKER_01 Okay, so this is our uh... uh chat interface.
- 09:12 SPEAKER_01 As you see, we have a few different login methods.
- 09:14 SPEAKER_01 You can log in with your email, just simple straightforward.
- 09:17 SPEAKER_01 We also have uh uh login with lightning and also with nostalgia.
- 09:22 SPEAKER_01 So we'll go ahead and demonstrate uh logging with the lightning.
- 09:27 SPEAKER_01 Obviously, you won't see me scanning here, but I'm just taking my lightning wallet and scanning the QR code, clicking sign in.
- 09:35 There we go.
- 09:37 SPEAKER_01 And yes, actually, everyone, we are live on Geis here doing a crowdfunding campaign.
- 09:42 SPEAKER_01 So you can pick up some cool Bitcoin trading cards or uh a new book that was written by AI, which is really cool.
- 09:49 SPEAKER_01 So go check that out.
- 09:50 SPEAKER_01 Uh using those fonts.
- 09:50 SPEAKER_01 Go check that out.
- 09:51 SPEAKER_01 We're using those funds to actually help our contributors in the lightning reward uh... uh training tool.
- 09:58 SPEAKER_01 So those will all those funds will be used to help train Satoshi.
- 10:01 SPEAKER_01 So go check it out.
- 10:03 SPEAKER_01 Uh but here, yeah, here's our chat interface.
- 10:05 SPEAKER_01 Uh pretty straightforward, you know, similar to existing large engines models you know.
- 10:10 SPEAKER_01 Uh here's some suggested prompts.
- 10:12 SPEAKER_01 Here's your your input field on the left here.
- 10:15 SPEAKER_01 You can see we have our different models.
- 10:17 SPEAKER_01 So this is the main uh uh education model Satoshi, and then we have our code assistant and our liquid technical assistant.
- 10:24 SPEAKER_01 Uh so I'll just be shortly uh doing a short demo of the technical assistant, and then I'll have uh Bruno demonstrate the code assistant.
- 10:32 SPEAKER_01 So yeah, so it's pretty straightforward.
- 10:36 SPEAKER_01 You just type in uh your prompt here.
- 10:38 SPEAKER_01 So let's just pick one of these.
- 10:40 SPEAKER_01 How do I perform a liquid swap and shoot it out.
- 10:46 SPEAKER_01 So right now it's uh just going to the model.
- 10:49 SPEAKER_01 The model is gonna you know generate a response.
- 10:51 SPEAKER_01 Right now we don't have streaming enabled, so we'll just take a second here for the whole uh... uh response to generate and to fill.
- 10:59 SPEAKER_01 Uh but what's cool is um this is all polling from existing documentation uh uh that we've sourced across the internet, and you'll see in the responses it will actually provide you hyperlinks where you can go and learn more about uh... uh the the information provided and check that out.
- 11:20 SPEAKER_01 So here's a response tells you how to perform a liquid swap using liquid swap tool or GDK or Liquid X, and then here you can see the different source material.
- 11:31 SPEAKER_01 So I can just click on that and go check out where the information that I got was.
- 11:37 SPEAKER_01 So here we're seeing the developer documentation portal about swaps, so I can go and dive in and read more without having to like click through everything, it can just link you right there.
- 11:48 SPEAKER_01 Um
- 11:49 SPEAKER_01 So yeah, we have uh a few different prompts they gave here.
- 11:52 SPEAKER_01 So for example, can you give me some technical instructions on how to unblind liquid confidential transactions?
- 12:00 SPEAKER_01 So you know, same thing here, gives you a breakdown of how to do it, gives you some source material.
- 12:06 SPEAKER_01 And then we also have this feedback here.
- 12:09 SPEAKER_01 So we'd love to, you know, anyone who's using the tool to give us feedback if there's some answers that are weak.
- 12:13 SPEAKER_01 Uh we'd love to hear about that.
- 12:15 SPEAKER_01 And this will also assist um, you know, the blockstream uh community at large to help us discover areas where documentation may be weak.
- 12:25 SPEAKER_01 Uh so that can be addressed.
- 12:28 SPEAKER_01 Uh just another quick prompt here.
- 12:31 SPEAKER_01 Uh this is kind of just a more random factoid setting the precision metadata value.
- 12:36 SPEAKER_01 So it can even get into some of the detailed weeds here to help people out.
- 12:40 SPEAKER_01 Um or just getting started, like, hey, I want to get started installing elements.
- 12:45 SPEAKER_01 Cool.
- 12:45 SPEAKER_01 We'll just give you a quick breakdown.
- 12:48 SPEAKER_01 Um, give you instructions on how to do it and let you know where else you can go.
- 12:55 SPEAKER_01 So yeah, that's the uh the technical assistant.
- 12:58 SPEAKER_01 Uh let's hop over to Renault for uh the code assistant.
- 13:05 SPEAKER_02 Hey guys, glad to be here.
- 13:07 SPEAKER_02 So let me share my screen here.
- 13:09 SPEAKER_02 Just a moment.
- 13:11 SPEAKER_02 Uh I believe this is one.
- 13:15 SPEAKER_02 Okay.
- 13:16 SPEAKER_02 Can you can you see right?
- 13:19 SPEAKER_02 So um just like the technical assistant, the code assistant.
- 13:24 SPEAKER_02 Uh we use the same interface.
- 13:26 SPEAKER_02 So as you can see, there's some examples here, like, for example, what is a mini script policy?
- 13:32 SPEAKER_02 Some people uh start uh want to learn about mini script, and the the okay, what is actually a policy?
- 13:41 SPEAKER_02 So there you can, for example, teach me some mini script basics.
- 13:45 SPEAKER_02 So you can go from from
- 13:46 SPEAKER_02 So you it can go from from further down from the absolute basic level in it can build the knowledge with you.
- 13:56 SPEAKER_02 So let me share some that I already have here.
- 14:00 SPEAKER_02 So write a mini script policy where11 or15 multi-sig can spend at any time, or an emergency key can spend after4030 uh32320 blocks.
- 14:17 SPEAKER_02 This is basically what liquid uh how liquid works.
- 14:22 SPEAKER_02 It's a multi-sig11 of15, and after30 days, which is roughly this size, it's the you can an emergency key can try to do something.
- 14:34 SPEAKER_02 So it brings you the mini script policy.
- 14:38 SPEAKER_02 I study it, so I'm I'm sure it works.
- 14:43 SPEAKER_02 Not only uh it works, but it compiles.
- 14:47 SPEAKER_02 So this is the policy, which is uh what is a policy is uh the behavior that you want the mini script uh the mini script to have, but uh the mini script there is also the code, which is uh you can you can build uh a multi-sig in uh using seguit, you can build it using tap roots, so you can have two different kinds of uh code that provide uh two codes that provide the same policy.
- 15:20 SPEAKER_02 So here it compiles to this code here, and it compiles also to this bitcoin script.
- 15:28 SPEAKER_02 So if you know bitcoin script, you can take a look to check, but if you don't know, you can learn how uh... uh bitcoin script here.
- 15:37 SPEAKER_02 You can uh ask in plain English how to write something in mini script, and now you can have something. in Bitcoin script, so you can uh understand what's happening, and also there's a spending cost analysis, which if you are actually using in production, you might want to try some different different things to have a lower weight unit, and uh in the end here we have a really small we are working on it to make it larger for these stream cases, but um it's a flow chart where you can it's it can become really clear what's happening.
- 16:21 SPEAKER_02 So there's15 um keys here, and here you cannot read it right now, but it's written.
- 16:29 SPEAKER_02 Let me see if I can I can yeah, it's a little bit bigger.
- 16:34 SPEAKER_02 So it's check11 out of15.
- 16:38 SPEAKER_02 So any of these11, uh you can you can use it, and then check one out of two.
- 16:45 SPEAKER_02 So you have uh or this this one or this one, this or this11 ones, or this check two out of two.
- 16:54 SPEAKER_02 So we need the emergency key older than4320 blocks.
- 17:01 SPEAKER_02 If you get any of those paths, okay, you can spend if no nothing.
- 17:06 SPEAKER_02 So you can uh even though there is a really lots of things written down here, it's really hard to read what it's written because uh it's kind of verbose of how many keys.
- 17:21 SPEAKER_02 You can actually take a look here in the flowchart, and it's uh and now it's really clear what's happening.
- 17:30 SPEAKER_02 So, for example, uh this is for for liquid, but let's say you don't know liquid, you don't know mini script.
- 17:37 SPEAKER_02 You... you you're just starting like what is a hash lock, and how can I create one in mini script?
- 17:43 SPEAKER_02 So here.
- 17:42 SPEAKER_02 Only mini script.
- 17:43 SPEAKER_02 So here there's a it's an explanation.
- 17:45 SPEAKER_02 Hash lock is a type of smart contract mechanism, and uh there's kinds of different kinds of hashes that you can use for hash lock, and uh this is the function.
- 17:56 SPEAKER_02 It there is a compile error because this is not actually the mini script policy, this is just the function.
- 18:03 SPEAKER_02 This is uh an example mini script policy.
- 18:06 SPEAKER_02 So you can pay if you pay if you have a key and also a hash, and uh so you have the mini script code and the bitcoin script code, and there is um uh uh flow chart here that is uh you can you can visualize what's what's going on, and uh you also can get something on online, for example, someone writes something in you and you but it's not compiling, and there's something wrong.
- 18:39 SPEAKER_02 So, what is wrong with this mini script policy?
- 18:41 SPEAKER_02 And you write it here, and uh it's it says here the end function actually only takes two parameters.
- 18:50 SPEAKER_02 There's three parameters here, so you can fix it using and nested and and inside an end.
- 18:57 SPEAKER_02 So this is the correct mini script policy, the mini script code, the bitcoin script, and this is what it compiles to is essentially a three out of three multi-sig.
- 19:08 SPEAKER_02 Uh so you can also get something and ask it to explain.
- 19:13 SPEAKER_02 So there is this really complex mini script here.
- 19:16 SPEAKER_02 Uh, and okay, you say you ask to explain, and you oh and it says okay, this is a hash time lock contract.
- 19:27 SPEAKER_02 Use the enlightening network, both three.
- 19:29 SPEAKER_02 This is a breakdown of the policy.
- 19:31 SPEAKER_02 So there's the key revocation, and then it goes inside every branch and explain everything.
- 19:39 SPEAKER_02 So basically, this is it.
- 19:41 SPEAKER_02 We
- 19:40 SPEAKER_02 Well basically this is it.
- 19:41 SPEAKER_02 We still have uh some work to do.
- 19:44 SPEAKER_02 This is alpha version, but it it already works really well.
- 19:48 SPEAKER_02 There's uh lots of uh uh uh there's lots of use cases that uh uh it suits really nicely.
- 19:56 SPEAKER_02 And uh yeah, we just like the other one, we can copy this and use it to send you someone or or something, or then you can uh put uh thumbs up or thumbs down if you like it or not.
- 20:12 SPEAKER_02 So this is it.
- 20:18 Awesome, thanks, Brenda.
- 20:19 SPEAKER_01 Yeah, like Breno said, this is uh kind of an alpha version.
- 20:23 SPEAKER_01 We'd love to get people to come in, try it out, uh help us figure out you know what areas are good, what areas are weak, how we can improve it, and uh obviously expand to additional uh... uh languages and protocols to help the Bitcoin community at large.
- 20:38 SPEAKER_01 Um so yeah, everyone who's here, we'd love to have you join us.
- 20:42 SPEAKER_01 Um reach out if you want to participate in the... the alpha and we'd love to get your feedback.
- 20:53 SPEAKER_03 Nice one uh uh uh thanks for the presentation, guys.
- 20:57 SPEAKER_03 Uh community or anyone in the live stream have any questions for them.
- 21:05 SPEAKER_02 Yeah, if you want also to test something, like uh is there a kind of mini script that you want to test, I can put it here and see if it works.
- 21:18 SPEAKER_02 Give an answer.
- 21:24 SPEAKER_03 I have a uh... uh non code related question.
- 21:27 SPEAKER_03 Yeah.
- 21:28 SPEAKER_03 That that I think uh uh one of the things you mentioned earlier in the presentation was that uh uh uh it you know it was able to um make predictions.
- 21:41 SPEAKER_03 And uh you mentioned the uh uh uh having date.
- 21:46 SPEAKER_03 Uh maybe we can play around with that and the code assistant.
- 21:50 SPEAKER_03 Just uh you know, because it is kind of changing between the19th and the20th.
- 21:55 SPEAKER_01 Just kind of uh okay, especially GPT.
- 21:58 SPEAKER_01 Yeah, yeah.
- 21:59 SPEAKER_01 Let's see what he says.
- 22:00 SPEAKER_01 Let's see what he says.
- 22:01 SPEAKER_01 All right, all right.
- 22:03 SPEAKER_03 And and does it uh I mean does it do uh price predictions as well?
- 22:08 SPEAKER_01 Um I actually don't know.
- 22:10 SPEAKER_01 I've I've never asked it though.
- 22:12 SPEAKER_01 I mean I'm sure it's just gonna hallucinate something, but who knows?
- 22:15 SPEAKER_01 Yeah, yeah.
- 22:17 SPEAKER_03 Maybe ask if uh let's see if you know ask it a specific question that uh is kind of relevant right now.
- 22:27 SPEAKER_03 Uh maybe how low will Bitcoin go in this current dip, or something like that.
- 22:34 SPEAKER_03 When is the most opportune time to buy during the dip?
- 22:38 SPEAKER_01 I don't know.
- 22:39 SPEAKER_01 I'm just playing around.
- 22:40 SPEAKER_01 Let's uh hold on, let me get into it one sec.
- 22:46 SPEAKER_02 Just to just ahead the heads up, probably this won't work.
- 22:51 SPEAKER_02 This is even too hard for humans.
- 22:54 SPEAKER_02 So for he's probably going to just ellucinate something or or say that it doesn't know.
- 23:01 SPEAKER_02 But um yeah, it's it's fun to play with this as well.
- 23:08 SPEAKER_03 Yeah, I didn't mean to uh derail the uh conversation or thing, it's just uh I thought it would be a little fun to experiment with it.
- 23:16 Yeah, it's fun.
- 23:20 SPEAKER_01 Hey, there you go.
- 23:21 SPEAKER_01 Predicting the price is for akin to forecasting the path of a leaf in the wind.
- 23:27 That's pretty funny.
- 23:29 SPEAKER_01 Pretty true, though.
- 23:30 SPEAKER_01 Yeah, yeah.
- 23:30 SPEAKER_01 Who knows, right?
- 23:36 SPEAKER_03 Okay, so it's just being uh uh uh yeah, it's just like acting like your mother or something, like oh you couldn't drink uh uh some good advice though.
- 23:46 SPEAKER_01 Don't focus on the structure.
- 23:50 SPEAKER_03 Okay, so so maybe uh ask it more specifically about the having date.
- 23:55 SPEAKER_03 Like what is the um let's see, yeah.
- 24:08 SPEAKER_01 So it's pulling some data right now.
- 24:11 SPEAKER_01 Okay, so you said uh what about ask it a specific time or you know what I mean?
- 24:28 SPEAKER_03 Like what uh I never mind again, I guess it gives you the uh the block.
- 24:32 SPEAKER_01 Because you're the block has some of it, yeah.
- 24:37 Okay, okay.
- 24:42 SPEAKER_03 Another question.
- 24:44 SPEAKER_03 Um you mentioned uh the inherent bias in all of uh uh you know when you're dealing with LLMs.
- 24:50 SPEAKER_03 So yeah, yeah.
- 24:51 SPEAKER_03 Uh for example, yeah.
- 24:56 SPEAKER_03 Like recently with Gemini, yeah.
- 25:00 SPEAKER_03 Would be a good example of of that inherent bias.
- 25:03 SPEAKER_03 So how like how have you guys trained on the back end the Satoshi AI to be non fiat?
- 25:12 SPEAKER_03 Yes, yes, so like a specific strategy there, or what?
- 25:16 SPEAKER_01 Great question.
- 25:17 SPEAKER_01 Um yeah, so it's actually been the the greatest challenge we face because um so we don't have the resources to train a foundation model just because that requires I mean literally millions of dollars um to build even even a small foundation model is what they're called.
- 25:34 SPEAKER_01 Foundation models what they're called is uh... uh building it from scratch, basically, right?
- 25:39 SPEAKER_01 So this that's what OpenAI has done, that's what Meta's done.
- 25:43 SPEAKER_01 Um that requires a very, very significant amount of resources, uh lots and lots of GPUs, um, and huge training data sets.
- 25:52 SPEAKER_01 So what we've uh have instead done is uh taken an existing foundation model and then performed continued pre-training and fine-tuning on that model.
- 26:02 SPEAKER_01 Uh and one of the challenges is is that a lot of that bias is kind of baked into the foundation model based on the previous training data.
- 26:08 SPEAKER_01 A lot of that comes from you know just the internet at large, uh Wikipedia, you know, books, YouTube transcripts, etc.
- 26:16 SPEAKER_01 And so it basically just inherits you know the general bias in the data sets used to create the foundation model.
- 26:23 SPEAKER_01 So uh the challenge is is getting that out.
- 26:27 SPEAKER_01 Um so what we've done is compiled our own data sets, uh, much of that coming from uh uh what's called the Nakamoto repository.
- 26:36 SPEAKER_01 So if you go to our site, um there's a link where you can go and view that.
- 26:42 SPEAKER_01 Uh if you go to the get involved, um share my screen real quick.
- 26:47 SPEAKER_01 Uh show you so here on our site.
- 26:52 SPEAKER_01 Uh if you click on the get involved tab, uh submit data.
- 26:56 SPEAKER_01 So this will take you to the Nakamoto repository.
- 26:59 SPEAKER_01 Uh this is just uh repository of all the Bitcoin uh works, whether it's a website, an article, a book, uh video.
- 27:08 SPEAKER_01 Uh this has been contributed from the community.
- 27:11 SPEAKER_01 Um so this will be a lot of the resources that were used in that.
- 27:14 SPEAKER_01 Uh also we uh uh partnered with uh Mises.org
- 27:19 SPEAKER_01 and they shared with us their uh extensive repository of Austrian economic works, which is fantastic.
- 27:26 SPEAKER_01 And so uh we tried just uh uh initially doing some fine tuning, which is to create um some Q.
- 27:32 SPEAKER_01 Create um some QA pairs based on these sources, and then do a fine tuning on the model, but it just wasn't sufficient to remove some of those biases.
- 27:42 SPEAKER_01 So we actually had to perform what's called uh continued pre-training, which is where you go back uh into the deeper layers of the model and you just uh uh train it to predict the next word, um using just plain text, so just chunks of plain text data from the sources, and that has really uh... uh helped kind of take it to the next level and get some of those biases out, but it's actually lots and lots of data that is um aligned with your the the bias that you're trying to train into the model and structuring that and and trying to train the model out.
- 28:19 SPEAKER_01 Bruno, you have any other input there.
- 28:24 SPEAKER_02 Um you you sum up really really well, good.
- 28:30 SPEAKER_03 Yeah, it's interesting.
- 28:31 SPEAKER_03 It... it you know it... it parallels with the uh whole orange pilling meme, right?
- 28:37 SPEAKER_03 You have to kind of uh... uh forget your preconceived notions, yeah, and... and re-learn things three, four, or five times to really grok and understand Bitcoin.
- 28:49 SPEAKER_03 So it's it's funny how you know even with this uh AI language model that it's it's essentially the same uh uh uh you know strategy here.
- 29:01 SPEAKER_02 Yeah, actually uh going going deeper into this this uh in language models is there's a huge overlap.
- 29:10 SPEAKER_02 Uh deep learning networks, they they first appear to mimic how our brains work.
- 29:18 SPEAKER_02 So in nowadays that uh this started on the50s.
- 29:23 SPEAKER_02 So it's really it's really old.
- 29:26 SPEAKER_02 Since we we call like neural net neural networks because they build uh a neuron what they call a neuron is a it's a simulation of what they think a neuron do more or less so right now when we have these larger models they they they use these models also in in um in neurologic and science they they they they they they they they they they they they they they they try to they they they they they they they they they they they the the engineers they try to learn uh neurology to apply new concepts to the... the large language models and other kinds of models and uh some some some some some some some some some some some scientists goes backwards as well they they they try to uh make some tests in in in models and in neural network and uh to see if if it makes sense or because it's really easier like you have a computer you can do whatever you want you can like do a brain surgery a brain scan and it's way cheaper than doing this in a in an actually actual human or human being so they there's these overlaps and and uh for the the forgetting phase is a really hard thing it's it's a really hot topic right now people actually don't know how to make a model actually forget uh they they they don't have a uh a uh a uh a uh... uh closed answer for this everyone in the world right now be the person working on open ai or ometa or whatever company they are working on they do not have uh a way to make the model forget so so sometimes it memorizes something and uh it could be all it could always be there and sometimes it appears out of no where
- 31:28 SPEAKER_02 Peers out of nowhere when you're not expecting.
- 31:31 SPEAKER_02 Of course, you can try to steer the model to what you want as most you can.
- 31:37 SPEAKER_02 This is what we are doing.
- 31:40 SPEAKER_02 But there's no there's no safe guarantee right now that the model actually forgets uh some wrong information that it learned before.
- 31:51 SPEAKER_03 Is this um is this what people like attribute to degradation?
- 31:58 SPEAKER_03 Like uh, you know, I've you know, of course, I've been following the AI space since uh Chatby T chat GPT was released, and a lot of people are suggesting that it's gotten dumber, or or it's not as powerful, or you know, it is this due to it falling back to its uh you know like quote unquote original memories, or... or are those um yeah, kind of what you describe in the scenario.
- 32:27 SPEAKER_02 Yeah, not exactly.
- 32:29 SPEAKER_02 Not exactly.
- 32:30 SPEAKER_02 Uh can be, but the thing is uh the a neural network is it's a highly complex thing that we... we don't know how to predict exactly what is going to come out.
- 32:43 SPEAKER_02 There's uh lots of these individual neurons.
- 32:46 SPEAKER_02 So when you when you hear uh model is7b, there's seven billion neurons, and uh these neurons are connected to each other in lots of different ways.
- 32:59 SPEAKER_02 And uh so when you when you try when you ask the model something, it it goes through this network of neurons, and it... it spits an answer, and uh so when it degrades, it's it means it means that um the model the the the performance metrics that they have are not great anymore.
- 33:26 SPEAKER_02 So it's really hard like
- 33:26 SPEAKER_02 So it's really hard.
- 33:27 SPEAKER_02 Like, how can you make sure that the model um is good or not when you're talking about language?
- 33:34 SPEAKER_02 When you're talking about like uh doing any specific test task is really easy.
- 33:41 SPEAKER_02 You can okay, if achieves this task, and how well you can achieve it's it's okay, but when it's language, you can do almost anything.
- 33:50 SPEAKER_02 It's uh it's really an open thing.
- 33:53 SPEAKER_02 So you have some metrics, for example, trying to make a school test, for example, answer uh uh a test about mathematics, about history, about common knowledge, or something like this, and uh evaluate with humans, which is really hard.
- 34:12 SPEAKER_02 And uh so when it degrades, uh it's it means that the the these metrics are getting worse.
- 34:21 SPEAKER_02 So they are sometimes they they can be forgetting, sometimes they they they have some unexpected things, just like, for example, when a human uh... uh learns about bitcoin, it starts to learn about lots of other stuff together with bitcoin.
- 34:38 SPEAKER_02 So there's lots of people that learn about bitcoin, and soon after they they they they they they they they they they they they became uh Christian and they they they they they start a family and they change their worldview, and they start being carnivore eating only meat, because sometimes one information in one place can uh change how some person can see other kinds of information, and this can happen with AI also.
- 35:05 SPEAKER_02 Every time that we are uh... uh training adding more information, uh, we can sometimes forget or corrupt the past information that model already have, or maybe even change it, uh giving a whole new perspective that we were not uh predicting it would have.
- 35:24 SPEAKER_01 uh uh uh predicting it would have makes sense it was it was super interesting after uh... uh actually just recently we did a another training uh experiment and asked it some uh questions that we didn't necessarily train it on but it surprisingly answered very well even though we couldn't find any thing in our data set that was actually associated with that topic so it is that's why it's a very iterative process because it's some of it is uh more art than science and you really have to experiment and just see what happens so it's so the something very surprising this was with the uh bitcoin language models that you're working on yeah yeah we're working on a new model uh uh uh another version uh that we're hoping to release actually uh into this month so yeah let's let me give you an example uh quick example so if you if you train the model in in uh large uh data so the model will have uh for example we'll know how to make a bomb and you go to chat GPT and you ask how how how do I make a bomb and uh it it's it's not going to say say oh this is this is not safe I cannot tell you and blah blah blah.
- 36:37 SPEAKER_02 So why why that that is because there's a training and then there is the fine tuning step where it uh the open AI team try really hard to keep the model let's say safe.
- 36:50 SPEAKER_02 So uh if if we uh uh train more and try to give uh the model other kinds of uh fine tuning for example explaining some things that the... the idea of safe is not there uh... uh like for example I I I explain how to I don't know kill a chicken to make uh lunch so if... if uh the model understands this as unsafe because you're killing uh living being it's uh in and... and then you but you train it to to always give an answer how to kill a chicken to make lunch, for example.
- 37:37 SPEAKER_02 Uh in the end, the model can uh uh have this concept of okay, I don't need to be that safe anymore.
- 37:47 SPEAKER_02 Uh uh, I don't need to be uh uh to only give safe uh answers.
- 37:51 SPEAKER_02 So even though I'm not saying about anything about bombs, uh just by instructing making the model uh... uh training the model to provide information that it can consider unsafe, it maybe releases it to provide other unsafe information as well.
- 38:15 SPEAKER_03 Well, I I don't know if this was you know, this is glossed over this aspect of AI language models, but I mean I was still under the assumption that things were very A to B, you know, input output, that there wasn't any kind of intuitive learning happening in the background.
- 38:32 SPEAKER_03 But it sounds you know I don't know if you guys can go deep into that example you gave where you were talking with the Bitcoin language model, and uh you asked it, I'm assuming about like adjacent cultural references like carnivore or whatever, and it gave you like yeah, that's how I would like have done the experiment, too.
- 38:51 SPEAKER_03 Uh uh, and so I'm just thinking out loud here, but um so yeah, you I'm assuming yeah, you asked it some culturally, you know, um similar questions, and it gave you information that is uh already kind of um you know propagating in the Bitcoin space, like you... you know, like I was saying, in terms of uh religiosity and uh and and uh diet and health, yeah.
- 39:19 SPEAKER_01 So for example, um uh trying to.
- 39:20 SPEAKER_01 Example, um trying to remember what the precise one was, but I believe it was about soy or something.
- 39:24 SPEAKER_01 It was like about soy healthy or something like that.
- 39:27 SPEAKER_01 And uh yeah, it... it answered exactly like you'd expect a bitcoiner to and then I said I don't remember us like ingesting sources about soy and so we went checked and I think there was like two examples in the entire data set of you know tens and tens of thousands of them.
- 39:42 SPEAKER_01 And so surprisingly like and and they weren't very specific either.
- 39:46 SPEAKER_01 Um so it's just very surprising that it was able to inherit such a an aligned world view with so little training data.
- 39:54 SPEAKER_03 Yeah, so it was like offhand remarks and an article that was part of the resource space or whatever, and it holds from that.
- 40:01 SPEAKER_03 Interesting.
- 40:03 SPEAKER_01 Sometimes they can learn even from just one or two examples, which is generally speaking, you'd want to have thousands of examples about soy if you wanted it to answer stuff about soy.
- 40:13 SPEAKER_02 So yeah, we... we have this idea that is just input and output because uh okay the answer like what what's being trained on, but actually it kind of makes uh an internal model of what's going on.
- 40:28 SPEAKER_02 For example, uh in our in our space we have three three dimensions, right?
- 40:34 SPEAKER_02 We have uh we have length, we have uh height, and uh depth.
- 40:39 SPEAKER_02 So we have three dimensions, and and we usually think about we... we it's hard for us to understand more than three dimensions, but when we passed when when we give uh something to the to the model to a language model, it breaks into tokens, which is uh words or a smaller part of a word.
- 40:59 SPEAKER_02 For example, if I gave the word cars, it's two tokens.
- 41:03 SPEAKER_02 One token is car, and another token is S, which is we so there's uh two tokens in in then it makes uh it it it the the process of tokenizing it transforms these uh
- 41:18 SPEAKER_02 forms this uh this token into a vector of uh... uh almost eight hundred dimensions so so uh... uh the model creates and tries to understand dimensions of this token but it's not uh spatial dimensions it's um um it's semantic dimensions so it's same it's dimensions of meaning so for example there's a classical uh explanation that if you get uh the... the token for the token king and subtract mathematic mathematically the... the uh king minus man plus woman equals queen so so you you you you have this uh this is just an example it's not exactly like this but this is uh helps understanding um um that it transforms it gets uh a a word or uh uh uh or uh a text and transforms into several vectors with eight hundred dimensions and uh and it tries to capture everything there is to capture of meaning inside these dimensions so it can uh as they relate to each other and so... so sometimes uh there's something that is not actually written there we didn't give as input but it learns because of the relations with other other token well thanks for going into the weights yeah yeah in the I mean I'm I'm quite new to the you know I I mean I've never looked up exactly how you know behind the scenes the uh you know tokenization process works or um you know, just yes, I had an idea that these language models were replicating brain synapses and networks, but um I was always skeptical of how much you know like intuitive creative thinking is happening.
- 43:33 SPEAKER_03 Because even the example you all use, it sounds like it's pulling just from a you know it it's still pulling from the data set, but you were surprised that it elevate some things within the resource data or whatever, because they were trained on so much more.
- 43:51 SPEAKER_03 Um but you know you know this brings up ethics and and a whole load of uh uh of questions with Mandora's box.
- 44:01 SPEAKER_01 Um yeah, well, that also that also is uh that's an important point because it it also means that we don't want AI to be completely captured by you know certain organizations that have biases or belief systems that are antithetical to ours, right?
- 44:18 SPEAKER_01 So we want to have other alternatives, we want to have a competition in the marketplace, um, so that it's not just completely controlled by these entities.
- 44:26 SPEAKER_01 For sure, and I and I think that's playing out in real time, right?
- 44:29 SPEAKER_03 Yeah, yeah.
- 44:32 SPEAKER_03 What was the X version?
- 44:33 SPEAKER_03 Uh Groc, and then um Google's Llama, that's correct, and then open, yeah, Meta's lama.
- 44:41 Yeah, yeah.
- 44:42 SPEAKER_01 And uh Mr is another one and out of Europe that's doing some good open source work.
- 44:48 SPEAKER_03 So yeah.
- 44:50 SPEAKER_03 Um I had an idea when you when we were talking about this.
- 44:54 SPEAKER_03 Um it would be funny to create a language model that's like incredibly bitcoin toxic, you know what I mean?
- 45:03 SPEAKER_03 Like the personification of of a uh Bitcoin maximalist.
- 45:07 SPEAKER_03 Kind of like how Grok is um, you know, how they've tried to market Grok in the space right now is that it's a little bit
- 45:14 SPEAKER_03 rock in in in the space right now is that it's a little bit more silly it's it's lighthearted yeah but with the understanding that it's not going to be censored as definitely but yeah uh do you all have ideas for I mean maybe maybe you don't want to spill the beans here in uh in a public forum but I'm sure you'll have lots of ideas on what you can do in the future in the Bitcoin space.
- 45:38 SPEAKER_03 Yeah toxic mode is what we're calling it oh okay so that you can yeah it's not here yet but yeah it's a it's a great idea I agree cool well if there's I don't think there's any more questions in the chat we kind of went on a tangent here at the end but um really fascinating stuff and thank you for both joining and and letting us uh quickly circle back yeah we'd love to have everyone come out uh to our website sort of satoshi.ai
- 46:06 SPEAKER_03 uh give satoshi a try um if you're interested in joining the alpha for either the minuscript assistant or the technical assistant um please reach out to us uh on our website or on Twitter um and we would love to get you access and get your feedback on that cool I think um as a follow up here um maybe useful to create a thread on bowl two and then uh people who watch this you know later on can can go there and can ask you questions about uh the new toxic mode that's coming all right guys awesome thanks for joining thanks everyone thanks for joining we'll see you guys