Alex's Notes

AI Is the Next Great Interop Layer

I had previously observed that humans are the great interop layer—we are the glue that fits together disparate processes and tools into usable systems. After using large language models, I’m becoming convinced that they can offload a large amount of the interop cost that currently falls to us. In a nut shell, AI can ‘do what I mean not what I say’ pretty darn well.

The latest LLM tools can interpret what we mean far better than before. Natural language interfaces opens up the possibility of sloppy interfaces—ones where not everything needs to be specified precisely. This allows for tolerance in the system making it easier for more things to fit together.

By contrast, APIs are largely a ‘do what I say’ interface. They require precise steps to complete an action. They require documentation to help a human understand how to use them. It takes creativity to implement them in ways that can solve their problem. Now we have LLMs that can figure out how to make API calls (Zapier NLAPI) and map different actions to different methods (langchain tools).

Published Apr 17, 2023

💬 Comments 🔗 Copy Link

Layering vs Chunking

When building large complicated things, there are two primary strategies that optimize for different things—layering and chunking.

Chunking is about taking something large, breaking it up into smaller pieces, and assembling them into one cohesive thing. This is pretty common and intuitive which pairs nicely with “divide and conquer” and can speed things up but tends to be all or nothing (either all the chunks are delivered or nothing ships).

Layering is also about breaking something large down but in a way that accumulates. The overall project is divided into smaller layers that are independently useful and shipped. This makes it more iterative, delivering value along the way and providing opportunities to adapt to new information.

Layering is much more desirable for large complicated projects because it’s incremental.

Problems arise when teams confuse chunking for layering—the team might think they are being fast and incremental, but deliver no value to the user until the end anyway.

I’ve observed many large-scale projects fail because they used the wrong strategy or because no one recognized that chunking was the only option.

To help spot these issues, a simple clarifying question to ask is: “At what point can a user actually use this?” (This can also be adapted for internal infrastructure projects, “At what point will this deliver value?").

(I read about this somewhere and apply it all the time but I can’t find the source, sorry!)

Published Apr 16, 2023

💬 Comments 🔗 Copy Link

Prompt Injection Attacks Are Unavoidable
While large language models are already useful for certain text based tasks, connecting them to other systems that can interact with the outside world poses new kinds of security challenges. Because it’s all based on natural language, any text can effectively become untrusted code.

Some examples:
- Adding a prompt injection attack on your public website that can be accessed by an LLM enabled tool like Bing search
- LLM-enabled personal assistant that can read your email might be prompt injected simply by sending them an email
- Data could be exfiltrated from a support ticketing system by sending a prompt injected message
- Training data might be poisoned by including prompt injected text that via hidden text
It’s unclear what the solution to these problems are. You could chain together more AI tools to detect prompt injection attacks. You could build protections into the prompt used internally. You could warn the user or log a message for every action taken and use anomaly detection.

Read Prompt injection: what’s the worst that can happen?
Published Apr 15, 2023

💬 Comments 🔗 Copy Link

Startup Multiples Fell From 100 to 39 Times ARR in 2023
According to a recent presentation by Redpoint, ARR multiples for Series B and C companies dropped sharply from the all-time-high in 2022 (100x) to a more sober 39x, representing a 4x premium compared to public markets.

See also:
- In public markets, revenue multiples fell from 24x in 2021 to 10x in 2022
- Running a startup during a recession
- If there continue to be as few deals getting done, the backlog of startups that raised in 2021 and 2022 will run out of cash starting end of 2023
Published Apr 14, 2023

💬 Comments 🔗 Copy Link

Natural Language User Interface

One of the super powers of large language models is that it can “do what I mean” instead of “do what I say”. This ability to interpret prompts can drastically lower the barriers to accessing and interoperating between systems. For example, writing “Send a slack message to the Ops channel with a list of customers from HubSpot that signed up in the last week” would generate actions that query the HubSpot Contacts API, parse and extract the results, and make another API request to Slack to post to the #ops channel.

The best example I can find of this is from Zapier’s Natural Language Actions (NLA) API. Zapier is perfectly positioned to enable a jump to universality for natural language UI because it already connects thousands of API providers together.

NLUI can also be useful as a frontend to a single service. For example, making a chat support bot that can access the content of support docs.

Published Apr 12, 2023

💬 Comments 🔗 Copy Link

Biologists Don't Make Good Medical Doctors
Deeply theoretical fields don’t necessarily translate to highly practical fields. Economists don’t typically do well as investors. Just because biologists know a lot about the inner workings of the human body, doesn’t mean they make great medical doctors.

Highly practical fields focus on effectiveness above all else. For example, you can still make a lot of money as an investor by having a flawed investment philosophy.

Highly theoretical fields focus on accuracy of an explanation. This seems to favor finding subtle niches to make contributions.

Read Why isn’t there a philosophy of business?

See also:
- Objective knowledge begins as conjecture and then corrected with criticism
- Abstractions are real and a medical doctor can be effective without knowing all the biology at a lower level
Published Apr 11, 2023

💬 Comments 🔗 Copy Link

Legal Services Has the Highest AI Occupational Exposure

A recent paper looking into the economic impact of large language models found that the legal industry has the most potential occupational exposure from AI including augmentation or substitution.

What is AI occupational exposure based on?

The AIOE measure was constructed by linking 10 AI applications (abstract strategy games, realtime video games, image recognition, visual question answering, image generation, reading comprehension, language modeling, translation, speech recognition, and instrumental track recognition) to 52 human abilities (e.g., oral comprehension, oral expression, inductive reasoning, arm-hand steadiness, etc) using a crowd-sourced matrix that indicates the level of relatedness between each AI application and human ability. Data on the AI applications come from the Electronic Frontier Foundation (EFF) which collects and maintains statistics about the progress of AI across multiple applications. Data on human abilities comes from the Occupational Information Network (O*NET) database developed by the United States Department of Labor. O*NET uses these 52 human abilities to describe the occupational makeup of each of 800+ occupations that it tracks. Each of 800+ occupations can be thought of as a weighted combination of the 52 human abilities. O*NET uses two sets of weights: prevalence and importance.

It seems a bit impractical to base the degree to which AI could disrupt an industry based on a list of occupational skills assigned to a category. Maybe it makes sense if you compare within the same score (telemarketers is on the opposite end of the spectrum from pressers).

Read How will Language Modelers like ChatGPT Affect Occupations and Industries

Published Apr 7, 2023

💬 Comments 🔗 Copy Link

Decoupling Work and Place

One of the silver linings of the COVID-19 pandemic is that work and place are decoupling is increasing the radius of economic opportunity. Previously, where you live and where you work were tightly coupled, but with the rise of remote work, more jobs can be done in more places.

When we look back in 10 to 20 years time, I’m confident we will see this decoupling as one of the most profound changes to the workforce since the internet. In many ways, remote work is a transportation technology that is only just beginning to transform where and how we live.

Published Apr 6, 2023

💬 Comments 🔗 Copy Link

LLM-First Programming Language

There are many barriers to adoption for a new programming language looking to go mainstream. You have to attract a small group of enthusiasts, build an ecosystem of high quality libraries, help new people learn, and eventually grow a talent marketplace.

Update: Someone built something similar! It’s not quite what I had in mind but it’s definitely a riff on the same idea.

The rise of LLMs adds another wrinkle—if programming with an AI assistant becomes commonplace, the degree to which the AI is good at the programming language could be a significant benefit (or detractor).

An LLM-first programming language might leverage AI assistants to make adoption significantly easier. I love the rust compiler because it is basically a self contained help system, but it requires study and practice to be productive. Imagine if you could become productive immediately and then learn more.

Not only that, an LLM-first programming language might have a deep repertoire of libraries, patterns, and problems it draws from that were specifically trained for language. I imagine this would be much more effective in day to day coding tasks than the aggregate output of blog posts that happen to be published on the internet and pulled into a blob of training data. It might even provide better context for other LLM applications to give more meaningful help (surely a compiler has some more precise data like the AST, the change, the types, etc that would be useful for figuring out what you are trying to do).

Published Apr 4, 2023

💬 Comments 🔗 Copy Link

ChatGPT Lowers Barriers to Building Small Projects
After using it for a few coding projects recently, I find that ChatGPT is a great way to lower the barriers to building smaller, self-contained projects—things that have been hiding in your to do list that take a bit too much effort to attempt but is still a good idea.

ChatGPT makes getting started easy. With a few prompts, you get semi working code to try out. Small projects don’t need much context, so there is very little context to explain (large amounts of context is a problem for LLMs). Using ChatGPT also offloads things you might not enjoy doing—writing documentation, product copy, or even writing basic tests.

I find pairing with ChatGPT to write code is ideal for internal tools and small projects to improve workflows. In just a few hours I was able to capture output of chatgpt-shell in org-mode, make an org export backend for Notion, and emacs sticky buffers—all I would have said is nice but not worth the effort.

See also:
- Knowledge capture loops make for good systems and ChatGPT speeds up each iteration
- ChatGPT is a great fit for one of one software like customizing your Emacs init file
Published Apr 3, 2023

💬 Comments 🔗 Copy Link

Capture Output of Chatgpt-Shell in Org-Mode

I’m messing around with ChatGPT in Emacs and I want to capture the results of a session in org-mode. Since chatgpt-shell is just a buffer (like everything else in emacs), we can write an elisp source block to grab the contents of buffer. Even better, we can tell org-mode to export the results as a markdown block since the code examples all use code fences (the triple backtick).

#+BEGIN_SRC emacs-lisp :results value :wrap src markdown :cache yes :exports results
(let ((buf (get-buffer-create "*chatgpt*")))
  (with-current-buffer buf
    (buffer-string)))
#+END_SRC

ChatGPT> You are an elisp programming expert assistant.
<gpt-end-of-prompt>
Great! How can I assist you today?

ChatGPT> How do I use org-mode to run a block that contains the output of another buffer? For example, there is a library that uses comint-mode as a chat interface. I want to embed that into an org-mode document, ideally using org-mode blocks. How do I do that?
<gpt-end-of-prompt>
You can use the `src` code block in org-mode to execute commands and capture their output. Here's an example:

First, you need to define the comint-based chat interface as a function that returns its output as a string. Here's a basic example using `comint-send-string` and `comint-show-output`:

```
(defun chat-interface ()
  (let ((buf (get-buffer-create "*chat*")))
    (with-current-buffer buf
      (erase-buffer)
      (comint-mode)
      (comint-send-string (get-buffer-process buf) "Hello, World!\n")
      (comint-show-output))
    (with-current-buffer buf
      (buffer-string))))
```

This function creates a new buffer called `*chat*`, switches to it, sends a message to the chat interface, shows its output, then returns the buffer's content.

Next, you can use this function in a `src` code block in org-mode like this:

```
#+NAME: chat-example
#+BEGIN_SRC emacs-lisp :results output
(chat-interface)
#+END_SRC
```

This creates a new named code block called `chat-example` that calls the `chat-interface` function and captures its output as the block's result.

You can now reference this block elsewhere in your org-mode document, include it in exports, etc. For example:

```
In this section we're going to use the chat interface to say hello:

#+RESULTS: chat-example
: Hello, World!
```

This would include the output of the `chat-example` block in the exported document.

ChatGPT> please print out the chat so far, formatted in org-mode sytnax
<gpt-end-of-prompt>
Certainly! Here's the conversation so far, formatted in org-mode syntax:

```
* Conversation with elisp programming expert assistant

** You, 2021-11-28 10:15
How do I use org-mode to run a block that contains the output of another buffer? For example, there is a library that uses comint-mode as a chat interface. I want to embed that into an org-mode document, ideally using org-mode blocks. How do I do that?

** Me, 2021-11-28 10:19
You can use the `src` code block in org-mode to execute commands and capture their output. Here's an example:

...

```

Is this what you were looking for?

ChatGPT> yes
<gpt-end-of-prompt>
Great! Let me know if you have any more questions.

ChatGPT> please
<gpt-end-of-prompt>
Sure, how can I help you?

ChatGPT> in emacs lisp, how do you get the contents of a buffer as a string?
<gpt-end-of-prompt>
You can get the contents of a buffer as a string in Emacs Lisp using the `buffer-string` function. You can call this function with the buffer you want to retrieve the contents from as an argument. Here's an example:

```
;; Get the contents of the current buffer
(buffer-string)

;; Get the contents of another buffer by name
(with-current-buffer "some-buffer-name"
  (buffer-string))
```

The first example retrieves the contents of the current buffer, while the second example retrieves the contents of a buffer named "some-buffer-name".

`buffer-string` returns the entire text of the buffer as a string, including any properties and text properties. If you only want the text, you can use the `buffer-substring` function instead. Here is an example:

```
;; Get the text of the current buffer
(buffer-substring (point-min) (point-max))

;; Get the text of another buffer by name
(with-current-buffer "some-buffer-name"
  (buffer-substring (point-min) (point-max)))
```

In this example, `buffer-substring` returns the text of the buffer as a string, excluding any properties and text properties. The arguments `(point-min)` and `(point-max)` specify the region to be returned, which in this case is the entire buffer.

I hope this helps!

ChatGPT> buffer-string is returning text from a buffer that looks like an S-expression. How do I get just the text?
<gpt-end-of-prompt>
If `buffer-string` is returning text that looks like an S-expression, it's possible that the buffer in question has some Lisp code in it. In that case, you can use the `buffer-substring-no-properties` function to retrieve the text content without any properties or S-expression formatting. Here's an example:

```
(with-current-buffer "some-buffer-name"
  (buffer-substring-no-properties (point-min) (point-max)))
```

This will return the text content of the buffer as a string, without any properties or formatting.

You could also use the `subseq` function to extract just the text content from the `buffer-string` output. Here's an example:

```
(subseq (buffer-string) 2 -2)
```

This will return the text content of the buffer without the first and last parentheses of the S-expression.

ChatGPT>

Published Apr 2, 2023

💬 Comments 🔗 Copy Link

Using ChatGPT With Emacs
There are a few packages and libraries that are being built to use ChatGPT along with Emacs.

This page is becoming fairly popular in search results so please let me know if there are other libraries you love or think I should try out.

Recommended

I use the following libraries myself for both writing code and prose (more prose than code these days).
- GPTel focuses on a chat client session within an Emacs buffer and sending text from the buffer (like a selected region or line). Also uses a transient menu (a la magit) for on-the-fly configuration like custom prompts and models to use.
- ChatGPT Shell is an eshell for ChatGPT and dall-e that fascilitats interacting with a chat session. By default it’s prompt is configured for helping with writing code.
Best of the rest

An incomplete list of other libraries I’ve come across.
- Emacs OpenAI has a collection of packages for using ChatGPT and dall-e
- ChatGPT.el was one of the first, but hasn’t been updated to use the official API yet
- ChatGPT Arcana is a more integrated approach including prompt from region
- org-ai combines OpenAI whisper and GPT so you can talk to Emacs (also has a nifty begin_ai block for org-mode)
- leafy-mode adds completions to org-mode using ChatGPT
Querying your own data

Most of the libraries mentioned rely on the user to send text to a chat session. However, they don’t have ways to index your own data to provide context to a chat session or for single-shot completions.

For example, I wrote a LangChain agent for helping me chat with my notes (also named org-ai, oh well!). While it’s one of one software and I have no intention of making it generic, you could adapt it to your own use.

See also:
- ChatGPT lowers barriers to building small projects
Published Apr 1, 2023

💬 Comments 🔗 Copy Link

Large Language Models

Large language models (LLMs) predict the next word in a block of text. At a basic level, you can think of it as a function that takes in text (i.e. a prompt) and returns text (i.e. a completion).

Published Mar 31, 2023

💬 Comments 🔗 Copy Link

Context Is Needed for Practical LLM Applications

One of the limitations of large language models is that it is often missing context when responding. LLMs like ChatGPT (GPT 3.5 and GPT 4 at time of writing) don’t allow fine tuning and can only take in ~4,000 tokens (roughly 3,000 words) as part of the prompt. That’s not nearly enough to provide context to be specific to your application. For example, you might want an AI coding tool to help you add a feature to your codebase, but the codebase is likely much larger than the prompt would allow.

To fit more information into prompts for context, LLMs could benefit from a cheatsheet, generated for the prompt. Combining tools like semantic search with an LLM could allow for better applications that are more specific to the user’s domain. For example, when asking an AI coding tool to add a function, the search part could load in all of the related modules or just the type signatures rather than the entire codebase.

Read Cheating is all you need.

Published Mar 31, 2023

💬 Comments 🔗 Copy Link

Edelman Trust Barometer
A global survey that attempts to measure the extent to which people trust or distrust institutions in their country and in other countries.

Read the latest report.

See also:
- Trust is an important factor of economic prosperity
- Remote work requires trust
Published Mar 30, 2023

💬 Comments 🔗 Copy Link

Businesses Are More Trusted Than Government
According to the Edelman Trust Barometer, people trust businesses more than they trust government, media, and NGOs. In the US, there is a 13 point gap between businesses and government.

See also:
Published Mar 30, 2023

💬 Comments 🔗 Copy Link

42 Percent of Series a Companies Are Fully Remote

A survey of founders by NFX found that 42% of Series A companies are 100% remote which is higher than 38% of seed-stage companies that are fully remote.

Read The NFX Founder Sentiment Report: The Post-SVB Edition.

Published Mar 23, 2023

💬 Comments 🔗 Copy Link

Remote Work Permanence
It’s becoming clear that remote work isn’t going anywhere. A large portion of the workforce continues to work from home. Return to office stagnated. Office real estate value is plummetting.

Employers are shifting from remote work enablement to remote work permanence.

With a permanent part of the workforce always working from home, there are important challenges ahead:
- Effective management of remote teams
- Compliance everywhere all at once
- Accommodating worker preferences
Effective management of remote teams

To be an effective people leader in a world of remote work permanence, managers need to be significantly more effective at building an environment where people can do their best work.

Trust is essential—especially in a remote-first organization. Being overly worried about people not putting in the hours or working two jobs simultaneously is not productive.

Unfortunately, there is reason for employees to feel distrustful. There are massive layoffs in tech stoking fear but don’t improve company performance. Companies are pausing promotions and raises and eliminating employee resource groups. Some are forcing workers to return to the office or be fired. The Overton window of employment is shifting.

What needs to happen?

Managers need to reorient around outcomes not appearing busy. This challenges preconceived notions about management and reveals how reliant we were on seeing people working rather than understanding what jobs entail and the outcomes that are needed to measure performance.

There is no software solution here. My guess is that there will be a massive retraining of managers needed and a change to more asynchronous working styles to fit an increasingly remote workforce. The good news is these changes benefit every employee by making a more equitable and effective working environment.

See also:
- This is an opportunity to win back some of Baumol’s cost disease
- Remote teams need to be more deliberate about everything
- How to write for remote teams
- How to work asynchronously
Compliance everywhere all at once

The rules for employment and tax are different in state, city, and even county. Remote work led to a dramatic increase in the number of locations a business operates in. However, the size of the People, Finance, and Legal teams remained the same.

There is step change in the amount of compliance work and complexity that businesses need to keep up with. In the US, 50 states might as well be 50 countries with all of the rules and changes businesses are expected to comply with.

This is the problem that Mosey is addressing.

See also:
- Compliance is a dynamic problem, it changes as the company grows and as the laws change so it’s never “done”
- One of the trickiest parts is that you don’t know there is a compliance problem until after it’s a problem
- Compliance is a limiting factor of opportunity
Accommodating worker preferences

With the rise of return-to-office policies, it’s clear what management’s preferences are. As I mentioned earlier though, it will never be all or nothing—competing for top talent now necessitates accommodating remote work. Similarly, not every organization or worker should be remote.

That leaves us with the final challenge: worker preferences. Some people prefer working in an office. Some people prefer working remotely. Some will change their mind.

Self-selection might be one way this plays out. People with a preference for remote choose working at a company that is fully remote. People that prefer in-office, will work at a fully collocated organization.

Of course, as a business grows larger they will need to accommodate more preferences. I imagine the ultimate solution for larger organizations is hybrid but rather than individuals splitting their time between home and office, the decision of how the team works will be pushed down to teams and their managers.
Published Mar 22, 2023

💬 Comments 🔗 Copy Link

Turning Cost Centers Into Profit Centers
A successful business playbook I keep coming across is turning cost centers into profit centers. That is, taking a problem that takes money to solve for one business, the solving it and offering it to other businesses that need it.

The best example is AWS. Amazon needed to solve the problem of high quality infrastructure to reliably host their ecommerce store. By making it available to everyone else, they ended up building a business that might end up being bigger than Amazon.

What are the cost centers in your business? If you concluded that you need to build it to properly solve it (build, buy, partner), might there be others interested in buying it?

See also:
- Commoditization increases the importance of distribution
Published Mar 21, 2023

💬 Comments 🔗 Copy Link

The Overton Window of Employment

There is a range of acceptable employer policies that ranges from unthinkable to popular. During the COVID-19 pandemic, previously radical employment policies such as work-anywhere became sensible and then accelerated in popularity.

We can see the window is shifting again today. Employees are being forced to return to the office or be fired. There are mass layoffs at profitable tech companies, reductions in benefits, pausing promotions, and cutting bonuses. We are even beginning to see relaxing child labor restrictions.

Published Mar 20, 2023

💬 Comments 🔗 Copy Link

Startup Growth Calculator
There is a long list of reasons a startup might fail, but running out of money is high among them. A common piece of advice for early startups is to make rapid progress towards “default alive” as in, running the business on revenue rather than relying on outside investment.

Startup Growth Calculator by Trevor Blackwell.

See also:
- Many startups will run out of cash starting end of 2023
- Running a startup during a recession
Published Mar 18, 2023

💬 Comments 🔗 Copy Link

Hawaii Turns Off Their Unemployment Insurance Website Outside of Business Hours

The Hawaii Unemployment Insurance portal becomes unavailable outside of their normal stated business hours. This can be surprising if one’s baseline assumption is that websites should always be up and their functionality doesn’t change whether it’s 9AM on a Monday or 2AM on a Saturday.

The idea of business hours for a website is kind of a good idea if the thing behind the website is actual people doing the job. If it can’t actually do anything for you during non-business hours, then why fake it? Adding queues makes state agencies eventually consistent.

On the other hand, people from a different timezone or odd working hours will find this approach inaccessible. Why should your timezone affect our ability to communicate from my timezone?

Published Mar 18, 2023

💬 Comments 🔗 Copy Link

Product Debt

Product debt is when promises made about functionality exceed what the product can actually do. When this happens there is only two things that can be done: 1) make up for it with manual work or 2) commit to building the functionality. If you stop doing 1) the customer will be disappointed. If you keep adding product debt and never do 2), the product will never keep up and you’ve inadvertently ended up building a services business.

The ideal amount of product debt, like tech debt, is greater than zero. It can be helpful to over promise when you doing customer development and figuring out what is going to sell. However, keeping in mind product debt (specifically that it must be paid down) means knowing a technical solution is possible and what’s promised stays within scope of what the technology and team are capable of.

One of the ways product debt increases without you realizing is in support and operations. As customers come across challenges, it is in the best interest of the team to help. Through the course of helping they may do things that the product is not capable of, working around an edge case here, handling something on behalf of the customer, etc. For this reason, product and engineering should spend time with support and operations to see how they are making up for shortcomings in the product—you might not be happy with the product debt you are incurring without knowing it.

Published Mar 16, 2023

💬 Comments 🔗 Copy Link

Answer Required, Not Necessarily a Good One
There are many times in business when an answer is required. A customer asking you a question about your product. An investor asking you about the market. A sales lead asking you about competitors.

A good answer is seldom necessary. It’s enough that you listened and answered the question. They may not be happy with your answer but satisfied to move on.

This is an important lesson because it can feel like a catastrophe when you’re asked a tough question or something you haven’t prepared for. Just keep in mind that in a very large percentage of these situations, people will be okay with the fact that you merely answered their question.

See also:
- This is why bullshitters are so effective and so problematic
- Rather than evaluating an answer with “does this sound reasonable?”, evaluate it with “is this a good explanation?”
Published Mar 15, 2023

💬 Comments 🔗 Copy Link

Effective management of remote teams

Compliance everywhere all at once

Accommodating worker preferences