Hi, I'm CY I study philosophy, democracy, and LLMs.

My research focuses on the epistemology of LLMs, normative interpretability of models, and applying various normative concepts to models.

Essays

all →
Fri, February 6, 2026 · 24 min read

Alignment and Large Language Models

#AI#Alignment

Field Notes

all →
CY
CY @polis_notebook · Thu, February 26, 2026 · Long Form

If we want to finish a story, the life that the story comes from must have more content than the story itself can contain.


中文翻译

如果我们想完成一个故事,那么故事所源自的生活必须拥有比故事本身所能容纳的更多内容。

writing life storytelling
CY
CY @polis_notebook · Wed, February 25, 2026 · Long Form

This is very social epistemology, essentially about testimony. Williams believes wishful thinking is a lie to ourselves. In the activity of deception, we not only blame the deceiver but also stress the importance of caution. So improving vigilance in Williams’s perspective matters equally. Self-deception in this sense is a failure because we cannot maintain vigilance on the formation of the beliefs we wish to believe are true. It is an activity that allows us to believe what we wish is true. Accuracy reappears at this point. It is a capacity to monitor our own judgment and to understand what our epistemic limits are.


中文翻译

这是非常社会认识论的内容,本质上关乎证言。威廉姆斯认为,一厢情愿的思维是我们对自己撒的谎。在欺骗活动中,我们不仅谴责欺骗者,也强调谨慎的重要性。因此,在威廉姆斯看来,提高警觉性同样重要。就此意义而言,自我欺骗是一种失败,因为我们无法对希望为真的信念形成过程保持警觉。它是一种让我们相信自己所希望之事为真的活动。准确性在此重新出现。它是一种监控自身判断并理解我们认识论局限的能力。

epistemology testimony self-deception vigilance
CY
CY @polis_notebook · Wed, February 25, 2026 · Long Form

It is very difficult to ask us to monitor our own thinking. It requires an advanced level of self-reflection. It requires us to have a research taste for what methods of inquiry are reliable, e.g., discussion and experiment are different from brainwashing or random guessing. If we believe these research methods are instrumental to truth, then we can distinguish what methods of inquiry are reliable.


中文翻译

让我们监控自己的思维是非常困难的。这需要高级别的自我反思。我们需要对哪些探究方法是可靠的进行研究测试,例如,讨论和实验与洗脑或随机猜测是不同的。如果我们相信这些研究方法是通向真理的工具,那么我们就能区分哪些探究方法是可靠的。

metacognition inquiry truth bias

Stay updated

Occasional essays on LLM epistemology, alignment, and political philosophy. No spam.