One thing I do look back on fondly was how incredibly focused we were. Resources and time were so tight that you could feel the weight of all the things you weren’t working on. You had real conviction that the thing you were doing was the most important thing.
To pick a somewhat trivial example, at fireside chats with Mark (the predecessor to the... See more
Unsurprisingly, hiring friends and former colleagues was by far the biggest channel. This also in part explains why multi-time founders, and anyone with a large network (e.g. Y Combinator), have an advantage:
“All of our early hires were friends/ex-coworkers.”
“First hires were practically all former colleagues. Several people who worked with me in
Every smart person I know is a voracious reader who also says “every smart person I know is a voracious reader.” There are so few exceptions to this rule it’s astounding. College tuition at $25,000 a year comes out to roughly $100 per lecture. Good books – sometimes written by the same professor – can be purchased for fifteen bucks and can offer... See more
Let’s first look at how to calculate self-attention using vectors, then proceed to look at how it’s actually implemented – using matrices.
The first step in calculating self-attention is to create three vectors from each of the encoder’s input vectors (in this case, the embedding of each word). So for each word, we create a... See more
“My whole life has been spent trying to teach people that intense concentration for hour after hour can bring out in people resources they didn’t know they had.”
— Edwin Land