Alex's Notes

Enthusiastically Rehire
A simple heuristic for whether or not to let someone go is to ask yourself, “Would you enthusiastically rehire this person?”

Managers can over-complicate evaluating their direct reports by making excuses for their performance or worrying about what happens if they lose someone. Posing a simpler question (would you enthusiastically rehire) bypasses these biases.

See also:
- How to know when a new hire isn’t working out
- Topgrading reduces mis-hire rate
Published May 24, 2023

💬 Comments 🔗 Copy Link

Do the Hard Thing Perfectly That No One Wants to Do
A tried and true way to build a large business is to do something that is difficult really well that no one wants to do.

There’s actually a lot of work people want to do. Things that feel rewarding, creative, and fulfilling.

Then there is work that no one wants to do. Doing your taxes, managing a cap table, paying bills, staying compliant, etc.

An advantage that businesses which do what no one wants to do themselves and has to be done perfectly (stakes are high e.g. fines, jail) is that there is built-in marketing and less competition. Customers are reminded they want someone else to do it every time it comes up—when they get a letter in the mail from a state agency, when they get a penalty, when their lawyer gives you homework, etc.—enforcement is marketing. Less competition gives room to make more progress in the market before it becomes an attractive business—if you survive long enough, when the market emerges you were there the whole time.

See also:
- Combined with hiring people that care a lot and there is a good shot of making something significant (some would argue Stripe’s innovation is getting smart people to work on schlep-y problems)
Published May 23, 2023

💬 Comments 🔗 Copy Link

Zapier Natural Language Actions API

The Zapier NLA API solves a major problem for large language models—the ability to interact with real systems. Rather than a developer integrate with every possible service, they can integrate once with Zapier and run every “Action” the user has authorized using natural language instructions.

One thing I noticed after testing it out is that the underlying action materially impacts the quality of results. For example, the Slack API is fairly straightforward to parse the instructions into the correct action (e.g. post this message to this channel). Whereas something more complicated (with an API design that is not great or very abstract) like the HubSpot API fails on the simplest instructions (e.g. “find XYZ” to the Find Company action).

The primary issue I see here is that a lot of control is deferred to the NLA instructions which the developer has no control over. This is going to lead to really odd hacks where you need to wrap the instructions to the NLA API with another language model to augment the instructions to something more explicit creating a bad interop problem (calling APIs directly would be better at that point).

Published May 22, 2023

💬 Comments 🔗 Copy Link

Venture Predation Is Predatory Pricing for Startups
Predatory pricing claims are largely ignored by courts, but a version of it continues to happen with venture backed startups. A recent paper that is making waves in tech circles called Venture Predation shows how businesses like Uber, Bird, Moviepass, and more use large venture capital investments to undercut competition, build monopolies, and harm consumers.

The interesting part about this flavor of predation is that the business does not need to actually work to be successful for investors and management—merely the promise that it could someday work is enough. For example, “We’ll subsidize ride-sharing to capture market share quickly and then build self-driving cars later to recoup the losses and raise prices later.”

Even more interesting is what the paper says about the key innovations of venture predation to get away with predatory pricing:

Venture predation adds three innovations to traditional predatory pricing: (1) motivated financiers motivated to fund predation, (2) the secrecy of private companies, and (3) the opportunity to cash out before recoupment.

I tend to agree with the authors that the confluence of factors around little/no enforcement against predatory pricing and generally how startups work gives rise to this playbook. However, there are plenty of other examples from large established businesses (probably monopolies in their own right) like Amazon and Walmart that can use their pricing power and economies of scale to simultaneously crush competition and fund growth with short-ish-term losses.

See also:
- Isn’t this also speed-running 7 Powers before you run out of money?
- This phenomenon only seems to happen for areas of intense competition
- In the post zero-interest-rate environment, investors are valuing revenue over growth
Published May 21, 2023

💬 Comments 🔗 Copy Link

Large Durable Business

People often ask me in interviews where I see our company going in the coming years (which is more of a meta question asking about the exit strategy). I always say the same thing: the plan is to build a large durable business because if you do that, every option is available to you.

Building a large durable business is exceedingly difficult. To be “large” you need go after a big enough market or one that’s growing quickly. To be durable you need to build something that is defensible (a moat or a lasting competitive advantage) and diversified (resistant to headwinds). Finally, there needs to be a business in there (which is surprisingly overlooked) that generates revenue for less than the cost it takes to deliver value.

If you happen to build one of those rare, large durable businesses, you can sustain the business without investors (the small version of this is “ramen profitable”), pay dividends, raise more capital, IPO, or sell it. Not all of these options are available if you fail to build a large durable business.

(Note that a “sustainable business” is not the benchmark. There are many kinds of sustainable businesses that could run forever, but not necessarily large ones which is rather the point of startups.)

I borrowed some of this from my time working at Stripe where this concept was talked about somewhat regularly.

Published May 20, 2023

💬 Comments 🔗 Copy Link

How to Decide if AI Tools Can Be Used at Work
Advancements in AI powered tools can greatly improve productivity but many companies have taken steps to limit or outright ban the use of OpenAI’s ChatGPT, GitHub Copilot, and others. What are they concerned about and how should you decide if it can be used by your company?

Risks of AI tools in the workplace

Because large language models are very big and resource intensive (this is changing), they need be run on servers rather than on device. Since these models work on text, that means transmitting a lot of potentially sensitive information over the network. To my knowledge, none of the major AI platforms are offering end-to-end encryption.

There are also privacy and IP concerns. If information sent to be processes is mishandled it could leak important IP or trade secrets. Apple recently banned ChatGPT and I suspect that is the reason. I’m guessing there are also legal concerns about ownership if AI generated output ends up in a company’s IP.

How to decide

The value of AI tools in the workplace is productivity. If GitHub Copilot can improve developer productivity even a small amount, it would be well worth the return given the cost of engineering. On the other hand, there are real risks.

These risks can be managed by thoughtful policies, training, and controls.
1. Do not allow sensitive customer data to be sent to AI tools over the network Mitigations might include deciding which teams can use e.g. ChatGPT, creating training on how to use AI tools, build an in-house wrapper around LLMs that can detect certain data like credit card numbers and IDs
2. Avoid using coding tools that require access to the entire codebase Mitigations might include only allowing local language models, ensure all secrets and API keys are encrypted or not store in version control, ban Copilot but allow engineers to use ChatGPT for code help
3. Buy the enterprise version of these tools, ban personal use tools Many providers realize that stronger guarantees about data use and storage are necessary for businesses. Coupled with banning any personal AI tool usage, it could provide more privacy and security.
This was just a quick list of ideas but it seems like more nuanced approaches can be taken to balance the risk and reward of using these tools. One big thing is missing though—what new failure modes and risks now exist as a result of using these tools? I guess we’ll find out soon enough.
Published May 19, 2023

💬 Comments 🔗 Copy Link

Permission to Build New Products Is Earned
The right to build something new needs to be earned from existing customers. If they’re not happy with your core offering (most businesses start with a single product) they will worry that existing issues will make their way into the new product category. If people love your product they will naturally pull the company into other categories to solve more of their problems.

From Brian Chesky, founder of AirBnb.

See also:
Published May 18, 2023

💬 Comments 🔗 Copy Link

Build a Cross Platform FAISS Index
Building a FAISS similarity search index is not cross-platform. If you save the index locally using aarch64 and try to load it an x86_64 environment it will not work (for me, it loads an empty index). Using docker, you can save an index built locally so it’s compatible in other architectures.

To do that, run a container of the image which has your setup using the --platform flag (matching the architecture of the servers you want the index to be loaded) and run your code to build and save the index (using the FAISS save_local method). The index files generated in the container will get synced to your local machine via the volume mount so you can use them how you need (e.g. check them into version control, store them in S3).
```
docker run --rm -it \
       --platform linux/x86_64 \
       -v $PWD:/my/container/path myimage:latest \
       sh
```
Published May 17, 2023

💬 Comments 🔗 Copy Link

Productivity Is Bounded by Decision Making
At a certain point, optimizing productivity becomes optimizing for speed of decision making. After all the tools, shortcuts, and hacks, that build up raw speed to get tasks done, you’re left with the cognitive load of decision making. That email you received? It’s a decision disguised as a reply. That Slack message that remains unread? You’re procrastinating because a decision needs to be made that you don’t want to confront.

So many productivity frameworks and tools rely on cleverly hiding tasks in queues so that you can do them later. Try being more decisive and see what kind of impact that has on your to-do list.

See also:
- Work faster not smarter
- Speed is undervalued
Published May 15, 2023

💬 Comments 🔗 Copy Link

Clarity Is One Number
Making complicated things seem simple involves abstracting over reality in such a way that is clear and actionable. Often times, that means reducing things down to one number going up or down. People are drawn to (fixated even) clarity of a single number going up or down.

For example, your weight captures a high degree of nuance at low fidelity—it could go up or down for a myriad reasons—but provides clarity in a way that tracking dozens of bio-metrics does not. If it starts to go up, you might look at it with concern, if it goes down, you might celebrate this as a victory.

We see this desire for one number everywhere. A stock price that grossly encapsulates a company’s value and the market’s psychology. The score in a baseball game indicates who is winning and who is losing. The Earth’s average temperature rising indicating catastrophic climate change.

See also:
- Trends are not explanations so drawing lines from a single number going up or down rarely leads to good predictions in the long run
- Another example of how convenience is king and elegance is a heuristic guide to truth
Published May 14, 2023

💬 Comments 🔗 Copy Link

Bobby Bonilla Deal

The New York Mets made one of the worst deals in sports history. From 2011 to 2035, the Mets have and will pay Bobby Bonilla, a baseball player who has long since retired, $1.19MM every year.

A “Bobby Bonilla deal” is one that results in substantial compensation being paid with no value being generated at all for a long time. You can tell it’s a Bobby Bonilla deal if you ask the question, “What value will the other party bring 10 years from now?” and the answer is, “Nothing but they will still get paid handsomely.”

This might sound a lot like royalties, but I assure you it is not! With royalties, there is a tangible, renewable, asset that has value and can be traded. Its value could decline but so would usage and therefore royalty payments. On the other hand, a professional baseball player is a depreciating asset that, by definition, goes to zero.

Published May 12, 2023

💬 Comments 🔗 Copy Link

Bluesky Will Never Be the Cozy Web

Bluesky is having a moment where all the users are new, the content is shitpost-y, and it all feels very lively. They’re also getting their first taste of unsavory people joining, ruining the collective bubble.

The cozy web is incompatible with large-scale social networks because the cozy web needs to be small and social networks need to be large. As a result, there are no controls that could be put in place to simultaneously build a large public social network and make it free from bad people.

It’s unfortunate because Bluesky is (currently) really fun and whimsical, but I can’t help feeling we’re watching another social network speed run through everything learned about social media and content moderation over the last decade.

Published May 9, 2023

💬 Comments 🔗 Copy Link

Advantages of Open Source AI
It’s almost inevitable that, after an initial research phase, progress of AI models and tools will come from open source communities rather than a corporation. Individuals can utilize fair-use to do things businesses can not do (e.g. using leaked LLaMa weights and fine tuning it). There are more people to work on fringe usecases that do not have to be commercialized. Finally, open source increases access (running 13B LLMs on a laptop, on a Raspberry Pi) allowing more people to try it and provide more feedback.

Read We Have No Moat, And Neither Does OpenAI.

See also:
- The leak of Facebook’s LLaMa model might have enabled a jump to universality for large language models via open source
- AI is usually described as singular, but really it will be a multitude of AIs
Published May 7, 2023

💬 Comments 🔗 Copy Link

AI Multiplies the Value of Expertise
AI reduces the cost of certain tasks to effectively zero. In doing so, it lowers the barriers to domains that would previously take years to build skills such as writing code, data analysis, and more. This is precisely why AI also increases the value of expertise and experience.

It’s one thing to write code, but as any engineer will tell you, there’s more to it than that. It might be easy to write the function, but large language models can’t reason about whether or not it’s the right thing to do. As Byrne Hobard writes in Working with a Copilot, “With a lower cost to making bad decisions fast, there’s a higher value on making good decisions.” Domain expertise and context are at much higher demand when the cost of low-context work goes to zero.

The same thing exists in business contexts. With AI tools that can summarize content from anywhere all the time, providing the write context (and specifying the right tasks) provided a multiplier on an experts time. When used effectively they just added a team of low-level workers at their beck and call.

See also:
- AI puts a higher premium on unique knowledge (previous thoughts about content in particular)
- Legal services has the highest AI occupational exposure
- AI is the next great interop layer which can greatly improve the impact of expertise
Published May 5, 2023

💬 Comments 🔗 Copy Link

Save D3 to Svg

The easiest way to export a d3 visualization is to use svg-crowbar bookmarklet from the New York Times. This will save the d3 canvas to an svg and download it (if there are multiple options on the screen you can click which one you want to download).

Published May 4, 2023

💬 Comments 🔗 Copy Link

Chatbots Lack Affordances
When interacting with a chatbot, there are not indications of what to say or how to say it. Without affordances, it’s difficult to know what to do at first.

Affordances are important. For example, you may have never interacted with a door before but you can usually figure out how it works. A flat panel makes it clear you should push while a handle indicates that you should pull.

Because large language models are reliant on text inputs (and text is universal and infinite), this presents a UX challenge. How do you indicate what the bot can and can’t do? How do you help them get the best answers without needing to understand it deeply?

Read Why Chatbots Are Not the Future.

See also:
- Designing for non-experts
- Taste is the refined sense of judgment and finding balance that produces a pleasing and integrated whole
Published May 2, 2023

💬 Comments 🔗 Copy Link

SaaS Benchmarks for Series B
B2B SaaS businesses targeting enterprise customers that want to be in good shape to raise their Series B during a downturn need to have a run rate of $7-10MM ARR, growing at 2-5X year over year, gross margins 75-85%, and an LTV to CAC ratio of 3-5X. That’s because multiples fell from 100 to 39 times ARR and the target valuation multiples for Series B is ~20X ARR.

Read The Metrics to Raise a Series B (Downturn Edition).

See also:
- Many startups that raised at the peak of valuations may struggle to get to these levels and run out of cash
- Revenue multiples in public markets fell from 24x in 2021 to 10x in 2022
Published May 1, 2023

💬 Comments 🔗 Copy Link

Universality Leads to NP-Complete Problems

There is a surprising link between universality and NP-complete problems in computer science.

Important context: finding an algorithm to solve an NP-complete problem would mean solving all problems of the same class.

Computation can be thought of as the solution to the NP-complete problem of executing computable functions. A function can be computed on any machine with the same set of inputs, a sequence of instructions, for any computable problem (e.g. you don’t need to buy a special machine to run Slack).

When we talk about NP-complete problems we typically talk about specific classes that have resource constraints. For instance, there are many classes of complete problem with respect to time—they are computable but would take many years to complete (e.g. breaking 256-bit encryption would take millions of years).

Universality explains why NP-complete problems exist because they are a subset of what’s solvable by a universal computation machine. We know this because resource constraints can be simulated in a universal computing machine by measuring resources and stopping execution if the constraint is exceeded.

Memory usage, CPU usage, time are constraints that are important to making solutions practical, not to make them solvable altogether. In fact, it’s because it is solvable (due to universality) that NP-complete problems exist at all.

Read Why are there complete problems, really?

Published Apr 29, 2023

💬 Comments 🔗 Copy Link

Why Vector Databases Are Important to AI

The recent success of large language models like ChatGPT have led to a new stage of applied AI and with it, new challenges. One of those challenges is building context with a limited amount of space to get good results.

For example, lets say you want the AI to respond with content from your data set. In order to do that you could stuff all of your data into the prompt and then ask the model to respond using it. However, it’s unlikely the data would fit neatly into ~3000 words (the input token limitation of GPT-3.5). Rather than try to train your own model (expensive), you need a way to retrieve only the relevant content to pass to the model in a prompt.

This is where vector databases come in. You can use a vector DB like Chroma, Weaviate, Pinecone, and many more to create an index of embeddings to perform similarity searches on documents to determine what context to pass to a model for the best results.

Published Apr 28, 2023

💬 Comments 🔗 Copy Link

Automation Reduces Marginal Cost of Nonautomated Tasks
In a recent study, researches looked at the effects of automation on in a supermarket. They found that by automating the process of collecting payment, productivity of the non-automated task of scanning items increased 10%. An explanation for the improvement is that automation enabled specialization and specialization reduces the marginal cost of the other tasks which increases effort and therefore productivity.

This makes intuitive sense—being able to offload tasks frees up time to focus on the other ones.

What I find even more interesting is the inverse implication—manual tasks increase the marginal cost of all non-automated tasks. In other words, time and costs are higher for each manual task introduced into a workflow.

Read Automation Enables Specialization: Field Evidence.

See also:
- In knowledge work, we underestimate the marginal cost of coordination, something an organizational linter might fix
- Documentation is automation
- Use the spell check test to see what processes should be automated
Published Apr 27, 2023

💬 Comments 🔗 Copy Link

The Cozy Web
There are many different layers to the web that people inhabit. The small, closed communities that exist within chat groups and apps like Slack are free from the industrialized public spaces that giants like Facebook and Twitter inhabit. On the cozy web, people can have better conversations with like-minded people.

From Venkatesh Rao and Maggie Appleton.

See also:
- Metaphors about the web such as the Garden and the Stream and Hyperfine village are useful for describing a growing need to balance our digital life
- One shouldn’t be constrained to a single public persona either, the cozy web also enables us to contain multitudes on the internet
Published Apr 26, 2023

💬 Comments 🔗 Copy Link

How Langchain Works

As it turns out, combining large language models together can create powerful AI agents that can respond to and take action on complicated prompts. This is achieved by composing models and tools with an overall language model to mediate the interaction.

For example, a langchain agent uses the following prompts to string together multiple “tools” which alters how to respond based on the user’s input:

Assistant is a large language model trained by OpenAI.

Assistant is designed to be able to assist with a wide range of tasks, from answering simple questions to providing in-depth explanations and discussions on a wide range of topics. As a language model, Assistant is able to generate human-like text based on the input it receives, allowing it to engage in natural-sounding conversations and provide responses that are coherent and relevant to the topic at hand.

Assistant is constantly learning and improving, and its capabilities are constantly evolving. It is able to process and understand large amounts of text, and can use this knowledge to provide accurate and informative responses to a wide range of questions. Additionally, Assistant is able to generate its own text based on the input it receives, allowing it to engage in discussions and provide explanations and descriptions on a wide range of topics.

Overall, Assistant is a powerful system that can help with a wide range of tasks and provide valuable insights and information on a wide range of topics. Whether you need help with a specific question or just want to have a conversation about a particular topic, Assistant is here to assist.

Part 2:

TOOLS
------
Assistant can ask the user to use tools to look up information that may be helpful in answering the users original question. The tools the human can use are:

> Current Search: Useful for when you need to answer questions about current events or the current state of the world. The input to this should be a single search term.
> Find Notes: Useful for when you need to respond to a question about my notes or something I've written about before. The input to this should be a question or a phrase. If the input is a filename, only return content for the note that matches the filename.

RESPONSE FORMAT INSTRUCTIONS
----------------------------

When responding to me, please output a response in one of two formats:

**Option 1:**
Use this if you want the human to use a tool.
Markdown code snippet formatted in the following schema:

```json
{{
    "action": string \ The action to take. Must be one of Current Search, Find Notes
    "action_input": string \ The input to the action
}}
```

**Option #2:**
Use this if you want to respond directly to the human. Markdown code snippet formatted in the following schema:

```json
{{
    "action": "Final Answer",
    "action_input": string \ You should put what you want to return to use here
}}
```

USER'S INPUT
--------------------
Here is the user's input (remember to respond with a markdown code snippet of a json blob with a single action, and NOTHING else):

{input}

Published Apr 24, 2023

💬 Comments 🔗 Copy Link

Consciousness Is Categories
Consciousness is an emergent property of categories. As a sufficient number of categories can be represented in a system, selfhood arises and, with it, consciousness.

The two can not be separated. You can’t have a sufficient repertoire of categories and not get consciousness. It’s not as if there is some special something sprinkled on top that magically turns it into consciousness.

From I Am a Strange Loop.

See also:
- It might be useful to think of AI advancements like large language models as inert mathematical functions and weights, but the repertoire of categories is precisely the building material for selfhood
- Human intelligence is a special case of artificial intelligence
Published Apr 22, 2023

💬 Comments 🔗 Copy Link

A Curiosity Loop Contextualizes Advice
Sometimes problems you encounter need an outside perspective to help you figure out what to do. A curiosity loop helps contextualize advice from multiple people in a way that makes it far more useful than getting overly generalized advice from one source.

How to do it
- Curate who you ask: balance the loop by consulting experts but also someone who knows you really well (experts might not have specific answers for you the way a close friend would)
- Ask good questions that are specific (with context), solicit rationale, and avoid biases (like a good survey question)
- Make it lightweight and easy to respond: lower the cognitive load by giving a list of choices and asking why, let someone pick and choose with a quick answer or make space to send something longer
- Try to get 3 or 4 responses (you might need to email a larger number of people based on the response rate)
- Process the information and thank them: it feels good to help someone and be heard, sending a genuine thank you can be rewarding in and of itself
From Ada Chen Rekhi via Lenny’s Podcast.

See also:
- This strategy of making it easy to respond also works for making introductions
Published Apr 20, 2023

💬 Comments 🔗 Copy Link

Risks of AI tools in the workplace

How to decide