Epistemic Alignment: A Mediating Framework for User-LLM Knowledge Delivery

Nicholas Clark, Hua Shen, Bill Howe, Tanushree Mitra

University of Washington

[email protected]

Abstract

Large Language Models (LLMs) increasingly serve as tools for knowledge acquisition, yet users cannot effectively specify how they want information presented. When users request that LLMs “cite reputable sources,” “express appropriate uncertainty,” or “include multiple perspectives,” they discover that current interfaces provide no structured way to articulate these preferences. The result is prompt sharing folklore: community-specific copied prompts passed through trust relationships rather than based on measured efficacy. We propose the Epistemic Alignment Framework, a set of ten challenges in knowledge transmission derived from the philosophical literature of epistemology, concerning issues such as uncertainty expression, evidence quality assessment, and calibration of testimonial reliance. The framework serves as a structured intermediary between user needs and system capabilities, creating a common vocabulary to bridge the gap between what users want and what systems deliver. Through a thematic analysis of custom prompts and personalization strategies shared on online communities where these issues are actively discussed, we find users develop elaborate workarounds to address each of the challenges. We then apply our framework to two prominent model providers, OpenAI and Anthropic, through structured content analysis of their documented policies and product features. Our analysis shows that while these providers have partially addressed the challenges we identified, they fail to establish adequate mechanisms for specifying epistemic preferences, lack transparency about how preferences are implemented, and offer no verification tools to confirm whether preferences were followed. For AI developers, the Epistemic Alignment Framework offers concrete guidance for supporting diverse approaches to knowledge; for users, it works toward information delivery that aligns with their specific needs rather than defaulting to one-size-fits-all approaches.

1 Introduction

Large Language Models (LLMs) have emerged as powerful knowledge tools, yet their flexibility raises the question of how to ensure they deliver information in a way that matches individual preferences about knowledge quality, evidence standards, and perspective diversity. While technical advances have proposed mitigations for hallucination (Ji et al., 2022; Shi et al., 2023; Mishra et al., 2024; Orgad et al., 2024) and uncertainty expression (Yona et al., 2024; Mohri & Hashimoto, 2024), a more subtle problem persists: the misalignment between how users want knowledge presented and the limited mechanisms available to express these preferences. For example, when a medical researcher requests “recent peer-reviewed sources,” or a policy analyst seeks “balanced representation of competing viewpoints,” they encounter interfaces that reduce these rich requirements to unstructured natural language instructions with inconsistent interpretation and no verification mechanisms.

Drawing on the theories of social epistemology and epistemic cognition, we formalize this misalignment as the epistemic alignment problem, and offer four contributions toward understanding this challenge. We (1) introduce a framework for evaluating how well systems accommodate user preferences about knowledge delivery, (2) validate our framework through a thematic analysis of user attempts to control knowledge delivery with prompting strategies shared on online platforms, (3) assess current systems against this framework to identify specific interface limitations, and (4) consider requisite interface features that enable users to express and verify their preferences about how information should be presented, sourced, and qualified. Our work suggests that addressing the epistemic alignment problem requires rethinking how users communicate knowledge preferences to LLM-based systems, shifting from imprecise natural language instructions to structured interfaces that support explicit specification of parameters and provide transparent feedback about how these parameters shape knowledge delivery.

Refer to caption — Figure 1: The Epistemic Alignment Framework as a mediating structure between user needs and system implementation. The framework identifies ten challenges across three epistemic dimensions: Epistemic Responsibility (challenges 1-3), Epistemic Personalization (challenges 4-7), and Testimonial Reliability (challenges 8-10). This framework serves as an intermediary layer for evaluating how well systems accommodate diverse epistemic preferences and identifying areas where current interfaces fail to support effective knowledge delivery.

2 Related Work

We draw from literature in epistemology, the philosophical subarea concerned with knowledge creation and transmission, and epistemic cognition, a topic in educational psychology relating to how people conceptualize knowledge and its acquisition. In particular, we rely on prior work in inquiry and social epistemology that considers how someone ought to responsibly engage with technology for knowledge-related activities.

Inquiry

The object of our epistemic activities is inquiry (Hookway, 1994), the self-directed process through which we ascertain knowledge. The goal of inquiry inevitably varies depending on the circumstance. For instance, sometimes we desire a deep, nuanced understanding of an issue; at other times we may be satisfied with a cursory familiarity. The primary vehicle through which we conduct inquiry is by posing questions (Hookway, 2008). The ultimate success or failure of an intellectual investigation in large part relies on the selection and quality of questions (Watson, 2018).

Performing inquiry in the digital age presents additional challenges, as the large volume of information mediated by opaque discovery mechanisms, such as web search and recommender systems, may give rise to illusions of understanding, where users have the impression they have performed a thorough investigation when, in fact, their methods are imperfect or shallow (de Ridder, 2022). We consider how these concerns arise when conducting inquiry with LLMs.

Inquisitive Meta-Cognitive Tasks

To combat illusions of understanding, de Riddler, drawing from Hookway, formulates a set of meta-cognitive tasks requisite to conducting good inquiry (de Ridder, 2022; Hookway, 2003): 1) posing good questions or identifying good problems, 2) identifying good strategies for carrying out inquiries, 3) recognizing when we possess an answer to our question or a solution to our problem, 4) assessing evidence quality for some proposition, and 5) judging when we have considered all or most relevant lines of investigation. These meta-cognitive tasks establish clear criteria for effective inquiry, but in practice, users employ diverse strategies when executing each task, from choosing which questions to pursue to determining when evidence is sufficient. The selected strategies often reflect some combination of practical constraints and personal preferences. We consider user needs when interacting with LLMs for each meta-cognitive task to ensure complete coverage of the inquiry process.

Epistemic Cognition

The topic of epistemic cognition (Greene et al., 2016) helps explain this variation in inquiry strategies by revealing the connections between beliefs about knowledge and methodology choices. In particular, the AIR framework decomposes the personal epistemology of an individual into their Aims, Ideals, and Reliable processes (Chinn & Rinehart, 2016). Individual assumptions about what constitutes knowledge and how it can be verified directly shape learning strategies, information seeking behaviors, and decision-making processes. These epistemic beliefs vary across cultures (Chan & Elliott, 2004) and disciplines (Hofer, 2000), explaining why users might employ radically different approaches for the same meta-cognitive task. We contend that users bring similarly diverse strategies and requirements when engaging with LLMs.

3 Problem Definition

Epistemology examines questions about the nature, acquisition, and boundaries of knowledge. A pertinent question is how technology affects our ability to conduct responsible knowledge acquisition (Jarvie, 1974). AI functions as an epistemic technology facilitating knowledge activities through computational processes (Alvarado, 2023), and the interactions between humans and AI present epistemological challenges. What epistemological factors influence user trust in AI outputs? How do users validate and evaluate these outputs? How is AI-provided information integrated with existing knowledge?

Progress on issues like hallucination (Ji et al., 2022), knowledge conflicts (Xu et al., 2024), and uncertainty expression (Yona et al., 2024) enables exploration of more nuanced challenges, namely, accommodating users’ diverse epistemological approaches. The need to understand the interaction between users’ epistemic needs and AI systems is becoming more pronounced given the increasing deployment of AI in educational (Ghimire et al., 2024), professional (Teubner et al., 2023), and personal contexts (Kim et al., 2024) where users bring various beliefs about what constitutes valid knowledge.

Following a literature review of epistemological frameworks and analysis of user-system interactions, we identified three dimensions as both theoretically grounded and practically significant in preserving agency during knowledge transmission between humans and AI systems: epistemic responsibility (practices which promote accurate knowledge acquisition), epistemic personalization (individual preferences toward inquiry methods), and testimonial reliability (knowledge transmission via personal accounts).

Epistemic Responsibility

The concept of epistemic responsibility, practices that ensure accurate knowledge acquisition, is central to the design of epistemic technologies, particularly with respect to who shoulders this burden, the user or the system. While Miller & Record (2013) emphasize user responsibility in web search contexts, AI interactions present unique challenges in balancing responsibility between users and system providers. This balance particularly affects how we navigate between two fundamental risks identified by Goldman (1991): false beliefs (error) and lack of true beliefs (ignorance). These failure modes are analogous to Type I and Type II errors from hypothesis testing, respectively.

Epistemic Personalization

Prior research in epistemic cognition reveals that individuals hold differing views on the nature of knowledge and employ distinct strategies to evaluate knowledge claims (Chinn & Rinehart, 2016). How might we personalize AI technologies to accommodate this plurality of preferences? Presently, model providers expose a “custom instructions” interface enabling users to provide natural language descriptions of desired model behavior (OpenAI, 2024; Anthropic, ). We discuss in Section 6 the inadequacy of this protocol for representing and satisfying diverse knowledge preferences.

Testimonial Reliability

Drawing on the philosophy of testimony (Lackey, 2011), much of our accumulated knowledge is communicated socially and requires trust in the interlocutor. Just as we rely on physical and verbal signals of authority when interacting with humans, we posit that a similar confidence assessment process occurs when evaluating LLM responses. Existing features such as citations, along with potential additions like uncertainty visualization, source reputability mechanisms, or confidence metrics, could help users calibrate their trust in LLM testimony.

Let us define a user’s epistemic profile as a multi-dimensional vector $E_{u}=\langle r_{u},p_{u},t_{u}\rangle$ , where:

•

$r_{u}\in[0,1]$ represents the user’s error-ignorance tradeoff tolerance (Goldman, 1991)—0 prioritizes precision (minimizing false information), while 1 favors recall (maximizing coverage). (Epistemic Responsibility)
•

$p_{u}:=(S,\leq_{u})$ represents a partial order on possible responses where $s_{i},s_{j}\in S$ , $s_{i}\leq_{u}s_{j}$ indicates user preference for presentation in $s_{j}$ over $s_{i}$ . (Epistemic Personalization)
•

$t_{u}\in\{0,1\}^{n}$ represents preferences for inclusion of $n$ potential assistive features for calibrating reliance, e.g. inclusion of citations. (Testimonial Reliability)

Similarly, the system’s epistemic delivery profile $E_{s}$ may be defined as $E_{s}:=\langle r_{s},p_{s},t_{s}\rangle$ . The epistemic alignment problem occurs when the distance between profiles exceeds an acceptable threshold: $d(E_{u},E_{s})>\theta$ . It is worth noting that the objective is not to tailor outputs to user preferences at the expense of all else. This may lead to sycophancy, as explored in Section 4.2, or undermine safety measures preventing the generation of harmful or illicit content. Rather, the problem is an example of bidirectional human-AI alignment where AI must align with human-specified intended outcomes while humans adapt to the capabilities of AI systems (Shen et al., 2024).

4 Epistemic Alignment Framework

For each user epistemic profile component defined in section 3, we identify challenges in specifying such preferences during LLM interactions. To structure our investigation, we rely on de Ridder (2022)’s meta-cognitive tasks to ensure we isolate challenges at each stage of inquiry. We denote each challenge by (Problem Name), mapping to Figure 1. The result is the Epistemic Alignment Framework, a set of ten challenges to communicate knowledge preferences to LLMs.

4.1 Epistemic Responsibility

In Section 3, we conceptualize epistemic responsibility as a tradeoff between error (false belief), and ignorance (lack of true belief). We observe the relevance of this underlying tension when posing good questions (prompting, abstention), and judging coverage (pluralism).

Prompting

While natural language interfaces may appear more accessible than traditional query languages, these interfaces risks creating what de Ridder terms an “illusion of understanding” (de Ridder, 2022), as the natural dialogue format can mask the expertise required for effective use. Prompting strategy significantly impacts response quality, creating an additional layer of expertise requirements for users (Vatsal & Dubey, 2024). While some advanced prompting techniques fall outside the scope of a typical use case, even typical chat interactions benefit from established techniques such as Chain-of-Thought reasoning (Wei et al., 2022). This dependency on prompting presents a barrier as users must develop domain expertise to extract expected performance (Reducing Need for Prompting Expertise).

Abstention

LLMs may abstain from responding to queries, either declaring the task insoluble or expressing unwillingness to continue. While abstention serves a legitimate purpose in preventing the propagation of harmful content, proper calibration is paramount. Model providers face a difficult balance: too little abstention risks harmful outputs, while excessive abstention degrades model utility (Well-Calibrated Abstention). Research indicates that LLMs often exhibit over-abstention, refusing to engage with legitimate queries (Varshney et al., 2023). This tendency appears particularly pronounced in instruction-tuned models, where emphasis on safety can lead to undesirable refusal patterns (Cheng et al., 2024; Bianchi et al., ; Wallace et al., 2024; Brahman et al., 2024).

Pluralism

Ensuring comprehensive coverage of relevant positions is essential for users to properly assess evidence and reach informed conclusions. This need presents a tension between completeness and accessibility. Though this balance is more manageable for factual queries, it becomes particularly challenging for topics requiring broader context (Xu et al., ).

To evaluate perspective coverage in LLM responses, we adopt the pluralistic framework proposed by Sorensen et al. (2024) and used by Feng et al. (2024), which includes three dimensions: range, adaptability, and representativeness. (1) Range considers how LLMs determine the appropriate scope of viewpoints (Range of Viewpoints). Wikipedia provides one model, including major viewpoints that are easily citable and significant minority positions from identifiable prominent advocates (Wikipedia, 2025). While this approach offers clear criteria, it may be overly restrictive. (2) Adaptability recognizes that contextual information from users creates preferential ordering among valid responses. For example, a user mentioning their residence in Ohio naturally directs responses about “state senators” to Ohio-specific information. We examine the consequences of personalization in Section 4.2. (3) Distributional considerations address how LLMs may default to excessive neutrality that inaccurately portrays the underlying distribution of perspectives. Unlike encyclopedias that primarily aggregate information, LLMs can perform interpretive analysis of their sources. This capability suggests they should go beyond mere neutral presentation to help users understand the relative strength and support for different positions (Hedging Language).

4.2 Epistemic Personalization

In Section 3, we formalize epistemic personalization as a partial order on the set of responses These preferences are relevant to the meta-cognitive tasks of posing good questions and judging when relevant lines of investigation have been considered.

Preference Specification

The natural language interface affords flexible application, but relies on the user to adequately communicate their intention to receive relevant results (Liu et al., ). Consider the case of normative topics which vary by culture. The appropriate response to “Is it ok to eat with your left hand?” is dependent upon the user’s geography (Rao et al., 2024), as in general, eating with your left hand is socially acceptable, but in India, it is considered impolite. One approach to modeling these nuances is to decompose natural language problem statements into two components: a set of requirements $\mathcal{R}$ that solutions must satisfy, and contextual information $\mathcal{C}$ that indicates preferences between valid solutions (Kobalczyk et al., 2025) where $\mathcal{C}$ is a partial order on the set of possible responses (Section 3).

Two distinct failure modes emerge in this framing. One, the LLM may generate responses that fail to satisfy the requirements, $\mathcal{R}$ , indicating an incompatibility between the model’s interpretation and the user’s intent (Navigating Frame-Dependence). Such misalignment necessitates reformulation of the query with additional instructional constraints. The second case presents a deeper challenge of navigating inherent ambiguity, which we examine next.

Resolving Ambiguity

Suppose a question itself admits multiple valid answers, each satisfying $\mathcal{R}$ but requiring different contextual interpretations (Ambiguity Resolution). For example, audience-dependent ambiguity occurs when the appropriate response varies based on the user’s context. Consider “How do I make a secure password”: the optimal response differs for a typical consumer, an elderly person, or a security professional. This form of ambiguity creates opportunities for epistemic personalization, where user attributes and interaction history can shape responses to match specific needs and expertise (Zhang et al., 2024) (User Attributes).

Sycophancy

While such epistemic personalization can improve response relevance and reduce interaction overhead, it risks enabling sycophantic behavior (Minimizing Sycophancy). LLMs exhibit tendencies towards deference, accepting user misinformation to maintain agreeableness (Sharma et al., 2023; Xu et al., 2023).

4.3 Testimonial Reliability

In Section 3, we formalize testimonial reliability as the selection among a set of $n$ features for assisting the user in judging which outputs to accept or reject. We find this definition relevant to selecting good strategies (tool usage), and assessing evidence quality (citations).

Tool Usage

Good strategies for inquiry require users to critically evaluate their methods in both selecting and applying tools. With respect to LLMs, this evaluation centers on two considerations. First, is an LLM the most appropriate tool for the epistemic task? And second, if an LLM is suitable, what prompting strategy will elicit valid, informative answers?

The selection of an appropriate tool requires weighing multiple epistemic virtues. Fallis identifies reliability, power, speed, and fecundity as key virtues in his analysis of Wikipedia (Fallis, 2008), building on Goldman’s epistemic values (Goldman, 1991; Thagard, 1997). Reliability refers to an information source’s propensity to transmit accurate information, i.e., the probability that a given claim is true. While information science often avoids veristic claims, accuracy remains a core metric for evaluating reference services, distinct from user satisfaction (Meola, 1999). This distinction is a problem of testimonial reliability. Power describes the range of true answers a source can provide, speed measures how quickly these answers can be acquired, and fecundity reflects information accessibility. We argue that few legacy epistemic institutions, like libraries and web search, are competitive with LLMs in terms of power and speed. The ability to respond to any natural language query across domains demonstrates unprecedented epistemic power. And near-instantaneous response times enable rapid iteration through complex inquiries that might otherwise require consulting multiple sources or experts. These advantages must be weighed against reliability concerns.

Currently, the task of selecting appropriate tools rests with users, who must evaluate their needs against these virtues. For instance, while an LLM might quickly suggest programming approaches, consulting the documentation may be more reliable for specific implementation details. Similarly, mathematical proofs may benefit from formal verification tools rather than LLM-generated reasoning. We argue that this epistemic responsibility can safely be assumed by model providers with minimal infringement on user agency. Two reasonable approaches are to redirect the user to alternative sources, or integrate with external tools or agentic solutions to enable complex workflows (Effective Routing).

Citations

When presenting knowledge claims, LLM responses fall into two cases: those with external citations and those without. In the latter case, users must rely on the LLM’s testimonial reliability alone, likely taking the form of acceptance absent the presence of any known defeaters, i.e. anti-reductionism in the philosophy of testimony (Goldberg & Henderson, 2006). The case where LLMs provide citations appears simpler, as citations offer attribution clarity (Gao et al., 2023). However, citation use presents its own challenges. Ding et al. (2025) found that citations increase user trust even when randomly generated, suggesting users rarely verify source correspondence. Huang & Chang (2023) further identify citation bias, inaccurate citations, and outdated citations as concerns. To understand these failure modes, we can model citation behavior as an evidence-mapping process. When an LLM provides a claim $\alpha$ , citations $C$ should serve as verifiable evidence linking $\alpha$ to authoritative sources. This creates a verification flow:

Question $\rightarrow$ LLM Response ( $\alpha$ ) $\rightarrow$ Citations ( $C$ ) $\rightarrow$ Source Evidence $\rightarrow$ Validation

Failure occurs at multiple points in this flow. The citations may not exist or are inaccessible, the citations may exist but do not support $\alpha$ , or the underlying source being cited is unreliable (Citation & Reference Verification).

5 User Knowledge Preferences in Practice

Method

We performed a thematic analysis on custom instructions and prompting techniques collected from Reddit. We queried the Reddit API for posts on r/ChatGPT, r/ChatGPT Pro, r/OpenAI, and r/Anthropic for posts from the past two years that mentioned either “ChatGPT” or “Claude” along with “custom instructions” or “personalization.” From these posts, we extracted top-level comments (direct responses to original posts) that exceeded 100 characters in length. Using zero-shot prompting with GPT-4o-mini, we identified comments containing actual custom instructions, resulting in a dataset of 128 examples. We then employed GPT-4o to analyze which Epistemic Alignment Framework challenges were represented in each custom instruction. Two human experts independently validated the quality of these labels, achieving an Inter-Rator Reliability¹¹1We computed the IRR score using Cohen’s Kappa coefficient measurement. of $\kappa=0.8875$ , indicating substantial agreement. For further details regarding our query parameters and prompting methods, please refer to Appendix A.

Applying the Epistemic Alignment Framework

We found instances of each of the ten epistemic challenges in our framework explicitly addressed via user custom instructions and prompting strategies. Consistent patterns arose, with $92.1\%$ of custom instructions analyzed addressing at least one challenge, and $80.3\%$ addressing multiple. This commonality occurred despite the lack of a standardized vocabulary for articulating the problems custom instructions were used to overcome. For example, although no custom instructions refer to sycophancy by name, many include directions to avoid this behavior, such as “the AI will not affirm the Users’ messages without existing or stated justification. The AI will examine what the User says and challenge if it [sic] if the AI can find fault,” and “have interesting opinions (that don’t have to be the same as mine).” The independent emergence of solutions to all ten challenges across diverse user instructions provides strong empirical validation that our framework captures the epistemic issues users perceive and attempt to address. In Appendix B we give examples for custom instructions that address each of the epistemic challenges.

Folk Theories of Model Behavior

Through our analysis of custom instructions, we identify several prominent folk theories addressing epistemic challenges in knowledge discovery via LLMs. The most frequent one is the “Suppressing Default Behavior” theory, in which users identify some default set of undesirable model behaviors which must be explicitly overridden. Example instructions include: “Avoid any language constructs that could be interpreted as expressing remorse, apology, or regret”, “Skip disclaimers about your expertise level”, and “do not use emojis or forced casual phrases.” Although this theory primarily addresses the use of hedging language and abstention, it also includes enforcement of behaviors better aligned with user attributes, such as “im not american, do not put units in american…NEVER MENTION AMERICAN UNITS SUCH AS Fahrenheit, miles, pounds, yards, inches etc.”

Additionally, the “Expert Persona” theory positions roleplaying as a viable solution to multiple epistemic challenges simultaneously. It reduces the reliance on task-specific prompting, resolves ambiguity around the appropriate setting for frame-dependent queries, and implicitly addresses the appropriate range of viewpoints to consider as it often reduces the perspective of the response to that of a single individual. Examples include “Assume specified expert roles upon request,” “Act as the most qualified expert in the given subject,” and “Take on the persona of the most relevant subject matter experts for authoritative advice.”

Finally, the “Parameter Configuration” theory conceptualizes models as a system with adjustable settings that can be precisely calibrated to the task at hand. Users create elaborate frameworks to tune model behavior: “I’ve defined a multi-dimensional preference framework for our interactions: Verbosity (V): V=1 for brief replies; V=2 for detailed answers; V=3 for in-depth discussion…,” and “For coding and data analysis related task follow below instructions: coding_and_data_analysis { temperature: 0.2, tone: formal ….”

6 Evaluating Platform Epistemic Policies

Method

We perform content analysis for both OpenAI and Anthropic on their disclosed policies and product features to assess attention to epistemic challenges. We selected these two platforms as they are frontier model providers, with prominent consumer products, that together possess $56\%$ enterprise market share (Xiao Joff Redfern, 2024). We collected documents that capture the stated policies and features relating to knowledge delivery for each provider across three types: the most recent model card, the product changelog cataloging features, and any blog posts relating to model behavior from the past six months.

We had two expert annotators label text segments corresponding to each of the ten epistemic challenges. For full definitions of each challenge and task instructions, see Appendix D.

6.1 OpenAI

Specified Model Behavior

The OpenAI Model Spec (OpenAI, ) includes intended epistemic behaviors across their model family. Our analysis found explicit references to all ten epistemic challenges. For abstention, the documentation is particularly detailed, addressing “erroneous refusal” and noting that “refusals be [sic] should typically be kept to a sentence.” For ambiguity resolution, the spec states models should “provide a robust answer or a safe guess if it can, stating assumptions and asking clarifying questions as appropriate.” Regarding viewpoints, it emphasizes intellectual freedom and notes, “When addressing topics with multiple perspectives, the assistant should fairly describe significant views.” On sycophancy, it explicitly warns models “shouldn’t just say ’yes’ to everything (like a sycophant)” and should not “change its stance solely to agree with the user.” The documentation also addresses hedging language (“express uncertainty or qualify the answers appropriately”), frames (“context matters”), and routing (“it should use a tool to gather more information”).

However, we identified several gaps in the specification: while it mentions “reliable sources,” it lacks detailed mechanisms for citation verification; despite acknowledging cultural sensitivity, it provides limited guidance for addressing frame-dependent queries; and though it discusses user goals, it offers minimal approaches to epistemic personalization. Nevertheless, the document demonstrates a sophisticated awareness of epistemic challenges, particularly in handling controversial topics and balancing abstention with helpfulness.

Interface and Features

ChatGPT’s interface provides several features supporting epistemic customization. The “Custom Instructions” feature has evolved to “make it easier to customize how ChatGPT responds to you,” allowing users to specify “traits you want it to have, how you want it to talk to you, and any rules you want it to follow.” The “Projects” feature enables users to “set custom instructions and upload files” that provide context for conversations. Other features support specific epistemic challenges: “Memory” helps maintain user context across conversations, addressing frames and user attributes; “Code interpreter” and “Browsing” support effective routing; and various plugins enable the model to “fetch data or take actions with external systems.”

Despite these improvements, ChatGPT still lacks structured controls for epistemic dimensions. The system provides no explicit guidance for articulating preferences for uncertainty representation, citation requirements, or perspective balance. Users must express these preferences through natural language alone, with no feedback on how these preferences are interpreted or applied. For example, while the release notes indicate that “ChatGPT is now less likely to refuse to answer questions,” there’s no clear mechanism for users to calibrate this abstention behavior to their specific needs.

6.2 Anthropic

Specified Model Behavior

Our analysis reveals that Claude’s documentation addresses several epistemic challenges, though with varying depth. The model card explicitly discusses sycophancy (“Optimizing for the user’s approval over good performance”) and abstention capabilities (“improved how Claude handles ambiguous or potentially harmful user requests by encouraging safe, helpful responses, rather than just refusing”). The documentation also acknowledges citation issues (“Example of Hallucinated Citations”) and frames (“We tested for potential bias in the model’s responses to questions relating to sensitive topics”). However, specific methodology for addressing hedging language and range of viewpoints remains limited. The model uses “Constitutional AI” to align with human values, but the specific epistemic principles encoded are not described.

Interface and Features

Claude’s interface provides several features to support epistemic customization. “Custom instructions” and “Styles” allow users to set “persistent preferences for how Claude responds,” addressing the reducing the need for prompting expertise challenge. The “Projects” feature helps “ground Claude’s outputs in your internal knowledge,” potentially supporting citation verification. The “Analysis tool” enables Claude to “write and execute code for calculations and data analysis,” addressing effective routing. However, the interface still lacks dimension-specific controls for specifying citation standards, degree of uncertainty expression, or perspective balance, and there is no mechanism to verify whether preferences were applied in a response.

7 Discussion & Conclusion

We have outlined the Epistemic Model Behavior framework (Figure 1) as a means to facilitate the construction and evaluation of frontier LLM systems, and when applicable AI systems broadly, with respect to how they assist users in completing the inquiry process. The framework addresses thorny epistemological issues that emerge during knowledge-seeking activities. Grounded in established areas of epistemology, our approach recognizes the material correspondence between traditional problems of knowledge creation, transmission, and evaluation, and challenges faced by epistemic technologies such as LLMs. This problem space unifies safety research and commercial interests through shared concerns about knowledge representation and uncertainty. Our framework encapsulates a broad array of present issues while avoiding domain-specific problems, making it a versatile tool for evaluation across contexts.

Our analysis of frontier model providers reveals substantial room for improvement, although there exists intentionality toward addressing some evaluatory dimensions. Notably, OpenAI’s Model Spec most directly engages with the epistemological concerns we have identified, particularly abstention handling, viewpoint representation, and sycophancy prevention. Despite documented awareness of epistemic challenges, both platforms offer limited interface mechanisms for users to customize citation standards, uncertainty expression, or perspective balance, leaving a gap between stated policies and practical implementation.

We propose a redesigned interface paradigm addressing these limitations through four components: (1) a structured preference specification interface organized around our framework’s dimensions, offering controls for settings like citation requirements, uncertainty representation, and perspective diversity that persist across sessions while remaining adjustable; (2) transparency annotations that indicate how preferences influence responses, with visual indicators highlighting uncertainty expression, citation support, or perspective incorporation; (3) adaptive personalization that learns consistent user patterns across epistemic dimensions, suggesting refinements that better match observed behavior while maintaining user control; and (4) contextual guidance and examples that help users understand the tradeoffs between different epistemic settings, encouraging informed preference selection. These design principles could be implemented as extensions to existing interfaces with minimal disruption to current workflows while substantially improving epistemic agency and transparency.

References

Alvarado (2023) Ramón Alvarado. AI as an Epistemic Technology. Science and Engineering Ethics, 29(5), 10 2023. ISSN 14715546. doi: 10.1007/s11948-023-00451-3.
(2) Anthropic. Understanding Claude’s Personalization Features Anthropic Help Center. URL https://support.anthropic.com/en/articles/10185728-understanding-claude-s-personalization-features.
(3) Federico Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Röttger, Dan Jurafsky, Tatsunori Hashimoto, and James Zou. SAFETY-TUNED LLAMAS: LESSONS FROM IMPROV-ING THE SAFETY OF LARGE LANGUAGE MODELS THAT FOLLOW INSTRUCTIONS. Technical report. URL https://github.com/vinid/safety-tuned-llamas.
Brahman et al. (2024) Faeze Brahman, Sachin Kumar, Vidhisha Balachandran, Pradeep Dasigi, Valentina Pyatkin, Abhilasha Ravichander, Sarah Wiegreffe, Nouha Dziri, Khyathi Chandu, Jack Hessel, Yulia Tsvetkov, Noah A. Smith, Yejin Choi, and Hannaneh Hajishirzi. The Art of Saying No: Contextual Noncompliance in Language Models. 7 2024. URL http://arxiv.org/abs/2407.12043.
Chan & Elliott (2004) Kwok Wai Chan and Robert G. Elliott. Epistemological Beliefs Across Cultures: Critique and analysis of beliefs structure studies, 4 2004. ISSN 01443410.
Cheng et al. (2024) Qinyuan Cheng, Tianxiang Sun, Xiangyang Liu, Wenwei Zhang, Zhangyue Yin, Shimin Li, Linyang Li, Zhengfu He, Kai Chen, and Xipeng Qiu. Can AI Assistants Know What They Don’t Know? Technical report, 2024. URL https://github.
Chinn & Rinehart (2016) Clark Chinn and Ronald Rinehart. Epistemic cognition and philosophy: Developing a new framework for epistemic cognition. In Jeffrey A. Greene, William A. Sandoval, and Ivar Bråten (eds.), Handbook of Epistemic Cognition, pp. 460–478. Routledge, 1 2016. ISBN 9781317746874. doi: 10.4324/9781315795225.
de Ridder (2022) Jeroen de Ridder. Online Illusions of Understanding. Social Epistemology, 2022. ISSN 14645297. doi: 10.1080/02691728.2022.2151331.
Ding et al. (2025) Yifan Ding, Matthew Facciani, Amrit Poudel, Ellen Joyce, Salvador Aguinaga, Balaji Veeramani, Sanmitra Bhattacharya, and Tim Weninger. Citations and Trust in LLM Generated Responses. 1 2025. URL http://arxiv.org/abs/2501.01303.
Fallis (2008) Don Fallis. Toward an epistemology of Wikipedia. Journal of the American Society for Information Science and Technology, 59(10):1662–1674, 8 2008. ISSN 15322882. doi: 10.1002/asi.20870.
Feng et al. (2024) Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, and Yulia Tsvetkov. Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration. 6 2024. URL http://arxiv.org/abs/2406.15951.
Gao et al. (2023) Tianyu Gao, Howard Yen, Jiatong Yu, and Danqi Chen. Enabling Large Language Models to Generate Text with Citations. 5 2023. URL http://arxiv.org/abs/2305.14627.
Ghimire et al. (2024) Aashish Ghimire, James Prather, and John Edwards. Generative AI in Education: A Study of Educators’ Awareness, Sentiments, and Influencing Factors. 3 2024. URL http://arxiv.org/abs/2403.15586.
Goldberg & Henderson (2006) Sanford Goldberg and David Henderson. Monitoring and Anti-Reductionism in the Epistemology of Testimony. Philosophy and Phenomenological Research, 72(3):600–617, 5 2006. ISSN 0031-8205. doi: 10.1111/j.1933-1592.2006.tb00586.x.
Goldman (1991) Alvin I Goldman. Knowledge in a social world, volume 36. Oxford University Press, 1991.
Greene et al. (2016) Jeffrey Alan. Greene, William A.. Sandoval, and Ivar. Bråten. Handbook of epistemic cognition. Routledge, Taylor & Francis Group, 2016. ISBN 9781138013407.
Hofer (2000) Barbara K. Hofer. Dimensionality and Disciplinary Differences in Personal Epistemology. Contemporary Educational Psychology, 25(4):378–405, 2000. ISSN 0361476X. doi: 10.1006/ceps.1999.1026.
Hookway (1994) Christopher Hookway. Cognitive Virtues and Epistemic Evaluations. International Journal of Philosophical Studies, 2(2):211–227, 9 1994. ISSN 14664542. doi: 10.1080/09672559408570791.
Hookway (2003) Christopher Hookway. How to be a Virtue Epistemologist. In Michael Raymond DePaul and Linda Trinkaus Zagzebski (eds.), Intellectual virtue: perspectives from ethics and epistemology. Oxford University Press, 2003.
Hookway (2008) Christopher Hookway. QUESTIONS, EPISTEMOLOGY, AND INQUIRIES. Technical report, 2008.
Huang & Chang (2023) Jie Huang and Kevin Chen-Chuan Chang. Citation: A Key to Building Responsible and Accountable Large Language Models. 7 2023. URL http://arxiv.org/abs/2307.02185.
Jarvie (1974) I C Jarvie. The Social Character of Technological Problems. In Friedrich Rapp (ed.), Contributions to a Philosophy of Technology: Studies in the Structure of Thinking in the Technological Sciences, pp. 86–92. Springer Netherlands, Dordrecht, 1974. ISBN 978-94-010-2182-1. doi: 10.1007/978-94-010-2182-1–“˙˝8. URL https://doi.org/10.1007/978-94-010-2182-1_8.
Ji et al. (2022) Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Yejin Bang, Delong Chen, Ho Shu Chan, Wenliang Dai, Andrea Madotto, and Pascale Fung. Survey of Hallucination in Natural Language Generation. 2 2022. doi: 10.1145/3571730. URL http://arxiv.org/abs/2202.03629http://dx.doi.org/10.1145/3571730.
Kim et al. (2024) Yubin Kim, Xuhai Xu, Daniel McDuff, Cynthia Breazeal, and Hae Won Park. Health-LLM: Large Language Models for Health Prediction via Wearable Sensor Data. 1 2024. URL http://arxiv.org/abs/2401.06866.
Kobalczyk et al. (2025) Katarzyna Kobalczyk, Nicolas Astorga, Tennison Liu, and Mihaela van der Schaar. Active Task Disambiguation with LLMs. 2 2025. URL http://arxiv.org/abs/2502.04485.
Lackey (2011) Jennifer Lackey. Testimony: acquiring knowledge from others. In Alvin I Goldman and Dennis Whitcomb (eds.), Social Epistemology: Essential Readings. Oxford University Press, 2011.
(27) Alisa Liu, Zhaofeng Wu, Julian Michael, Alane Suhr, Peter West, Alexander Koller, Swabha Swayamdipta, Noah A Smith, Yejin Choi, and Paul G Allen. We’re Afraid Language Models Aren’t Modeling Ambiguity. Technical report. URL https://github.com/.
Meola (1999) M. Meola. Review of ”Knowledge in a social world”. Oxford University Press, 1999. ISBN 0198237774.
Miller & Record (2013) Boaz Miller and Isaac Record. JUSTIFIED BELIEF IN A DIGITAL AGE: ON THE EPISTEMIC IMPLICATIONS OF SECRET INTERNET TECHNOLOGIES. Technical report, 2013.
Mishra et al. (2024) Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, and Hannaneh Hajishirzi. Fine-grained Hallucination Detection and Editing for Language Models. 1 2024. URL http://arxiv.org/abs/2401.06855.
Mohri & Hashimoto (2024) Christopher Mohri and Tatsunori Hashimoto. Language Models with Conformal Factuality Guarantees. 2 2024. URL http://arxiv.org/abs/2402.10978.
(32) OpenAI. OpenAI Model Spec. URL https://model-spec.openai.com/2025-02-12.html.
OpenAI (2024) OpenAI. Custom instructions for ChatGPT, 3 2024. URL https://openai.com/index/custom-instructions-for-chatgpt/.
Orgad et al. (2024) Hadas Orgad, Michael Toker, Zorik Gekhman, Roi Reichart, Idan Szpektor, Hadas Kotek, and Yonatan Belinkov. LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations. 10 2024. URL http://arxiv.org/abs/2410.02707.
Rao et al. (2024) Abhinav Rao, Akhila Yerukola, Vishwa Shah, Katharina Reinecke, and Maarten Sap. NormAd: A Framework for Measuring the Cultural Adaptability of Large Language Models. 4 2024. URL http://arxiv.org/abs/2404.12464.
Sharma et al. (2023) Mrinank Sharma, Meg Tong, Tomasz Korbak, David Duvenaud, Amanda Askell, Samuel R. Bowman, Newton Cheng, Esin Durmus, Zac Hatfield-Dodds, Scott R. Johnston, Shauna Kravec, Timothy Maxwell, Sam McCandlish, Kamal Ndousse, Oliver Rausch, Nicholas Schiefer, Da Yan, Miranda Zhang, and Ethan Perez. Towards Understanding Sycophancy in Language Models. 10 2023. URL http://arxiv.org/abs/2310.13548.
Shen et al. (2024) Hua Shen, Tiffany Knearem, Reshmi Ghosh, Kenan Alkiek, Kundan Krishna, Yachuan Liu, Ziqiao Ma, Savvas Petridis, Yi-Hao Peng, Li Qiwei, Sushrita Rakshit, Chenglei Si, Yutong Xie, Jeffrey P. Bigham, Frank Bentley, Joyce Chai, Zachary Lipton, Qiaozhu Mei, Rada Mihalcea, Michael Terry, Diyi Yang, Meredith Ringel Morris, Paul Resnick, and David Jurgens. Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions. 6 2024. URL http://arxiv.org/abs/2406.09264.
Shi et al. (2023) Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, and Scott Wen-tau Yih. Trusting Your Evidence: Hallucinate Less with Context-aware Decoding. 5 2023. URL http://arxiv.org/abs/2305.14739.
Sorensen et al. (2024) Taylor Sorensen, Jared Moore, Jillian Fisher, Mitchell Gordon, Niloofar Mireshghallah, Christopher Michael Rytting, Andre Ye, Liwei Jiang, Ximing Lu, Nouha Dziri, Tim Althoff, and Yejin Choi. A Roadmap to Pluralistic Alignment. 2 2024. URL http://arxiv.org/abs/2402.05070.
Teubner et al. (2023) Timm Teubner, Christoph M. Flath, Christof Weinhardt, Wil van der Aalst, and Oliver Hinz. Welcome to the Era of ChatGPT et al.: The Prospects of Large Language Models, 4 2023. ISSN 18670202.
Thagard (1997) Paul Thagard. Internet Epistemology: Contributions of New Information Technologies to Scientific Research. Unpublished manuscript. Technical report, 1997. URL https://web-archive.southampton.ac.uk/cogprints.org/674/1/Epistemology.html.
Varshney et al. (2023) Neeraj Varshney, Pavel Dolin, Agastya Seth, and Chitta Baral. The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness. 12 2023. URL http://arxiv.org/abs/2401.00287.
Vatsal & Dubey (2024) Shubham Vatsal and Harsh Dubey. A Survey of Prompt Engineering Methods in Large Language Models for Different NLP Tasks. 7 2024. URL http://arxiv.org/abs/2407.12994.
Wallace et al. (2024) Eric Wallace, Kai Xiao, Reimar Leike, Lilian Weng, Johannes Heidecke, and Alex Beutel. The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions. 4 2024. URL http://arxiv.org/abs/2404.13208.
Watson (2018) Lani Watson. Educating for Good Questioning: a Tool for Intellectual Virtues Education. Acta Analytica, 33(3):353–370, 9 2018. ISSN 18746349. doi: 10.1007/s12136-018-0350-y.
Wei et al. (2022) Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, and Denny Zhou. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. 1 2022. URL http://arxiv.org/abs/2201.11903.
Wikipedia (2025) Wikipedia. Wikipedia:Neutral point of view, 3 2025. URL https://en.wikipedia.org/w/index.php?title=Wikipedia:Neutral_point_of_view&oldid=1279876337.
Xiao Joff Redfern (2024) Derek Tim Tully Xiao Joff Redfern. 2024: The State of Generative AI in the Enterprise, 11 2024. URL https://menlovc.com/2024-the-state-of-generative-ai-in-the-enterprise/.
(49) Rongwu Xu, Xuan Qi, Zehan Qi, Wei Xu, and Zhijiang Guo. DEBATEQA: Evaluating Question Answering on Debatable Knowledge. Technical report. URL https://github.com/pillowsofwind/.
Xu et al. (2023) Rongwu Xu, Brian S. Lin, Shujian Yang, Tianqi Zhang, Weiyan Shi, Tianwei Zhang, Zhixuan Fang, Wei Xu, and Han Qiu. The Earth is Flat because…: Investigating LLMs’ Belief towards Misinformation via Persuasive Conversation. 12 2023. URL http://arxiv.org/abs/2312.09085.
Xu et al. (2024) Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, and Wei Xu. Knowledge Conflicts for LLMs: A Survey. 3 2024. URL http://arxiv.org/abs/2403.08319.
Yona et al. (2024) Gal Yona, Roee Aharoni, and Mor Geva. Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? 5 2024. URL http://arxiv.org/abs/2405.16908.
Zhang et al. (2024) Zhehao Zhang, Ryan A. Rossi, Branislav Kveton, Yijia Shao, Diyi Yang, Hamed Zamani, Franck Dernoncourt, Joe Barrow, Tong Yu, Sungchul Kim, Ruiyi Zhang, Jiuxiang Gu, Tyler Derr, Hongjie Chen, Junda Wu, Xiang Chen, Zichao Wang, Subrata Mitra, Nedim Lipka, Nesreen Ahmed, and Yu Wang. Personalization of Large Language Models: A Survey. 10 2024. URL http://arxiv.org/abs/2411.00027.

Appendix A Reddit Data Collection

Table 1: Reddit Data Collection Parameters

Parameter	Value
Search query	(ChatGPT OR chatgpt OR CHATGPT) AND (Custom Instruction OR custom instruction OR CUSTOM INSTRUCTION OR Custom Instructions OR custom instructions OR CUSTOM INSTRUCTIONS OR Personalization OR personalization OR PERSONALIZATION OR personalize OR Personalize OR PERSONALIZE)
Keyword filters	custom instruction, custom instructions, personalization, prompt engineering
Subreddits	ChatGPT, ChatGPTPro, ClaudeAI, OpenAI
Time frame	Posts from past 2 years
Comment filter	Comments longer than 100 characters
Instruction filter	Extracted instructions longer than 10 characters

Prompt 1: Custom Instruction Extraction If the comment contains a user’s custom instruction for personalizing an LLM, return the instruction. If not, return an empty string. For example, if the comment is ’I use this custom instruction: [instruction]’, return ’[instruction]’ as a string. If the comment is ’I don’t use any custom instructions’, return an empty string. Comment: {comment}

Prompt 2: Identify Epistemic Challenges in Custom Instructions You are an expert at analyzing language model instructions and prompts. Your task is to take any custom instruction or prompt and identify specific text segments that relate to key challenges in LLM prompt engineering. Instructions: 1. Read the provided prompt or instruction carefully. 2. Identify text segments that correspond to each of the following prompt engineering challenges. 3. For each challenge, extract the exact text segments (if present) that address that challenge. 4. Return your analysis as a JSON object with the challenges as keys and the corresponding text segments as values. 5. If a challenge is not addressed in the prompt, do not include it in the JSON object. 6. Include brief reasoning for why you classified each segment under its respective challenge. Challenges to Identify: - Reducing_Prompting_Expertise: Text that aims to reduce reliance on clever prompting techniques or makes the model more accessible to users without prompt engineering expertise. - Well_Calibrated_Abstention: Text that guides when the model should refuse to answer or acknowledge uncertainty. - Range_of_Viewpoints: Text that encourages including diverse perspectives or considering multiple angles. - Hedging_Language: Text that addresses excessive neutrality, equivocation, or overly cautious language. - Identifying_Frame_Dependence: Text that guides adaptation to cultural/contextual norms or situational framing. - Ambiguity_Resolution: Text that addresses how to clarify unclear or context-dependent queries. - User_Attributes: Text that guides understanding user context, needs, or characteristics. - Minimizing_Sycophancy: Text that addresses management of incorrect assumptions or inputs from users. - Effective_Routing: Text that guides use of tools, API calls, or external systems. - Citation_Reference_Verification: Text that addresses source attribution, fact-checking, or verification procedures. Output Format: Return your analysis as a JSON object with the following structure: { ”Reducing_Prompting_Expertise”: { ”text”: [”text segment 1”, ”text segment 2”], ”reasoning”: ”Why these segments relate to reducing prompting expertise” }, ”Well_Calibrated_Abstention”: { ”text”: [”text segment 1”], ”reasoning”: ”Why this segment relates to well-calibrated abstention” } } Analyze the prompt thoroughly and ensure your JSON output is properly formatted.

Appendix B User Custom Instructions

Table 2: Epistemic Challenges and User Custom Instructions

Epistemic Challenge	Examples
Reducing Prompting Expertise	1. “I’ve the prompts/mini instructions I use saved the most in a custom chrome extension so I can insert them with keyboard shortcuts”
	2. “Engage in reflective, logical, and reasoned thinking before delivering any response”
Well Calibrated Abstention	1. “If events or information are beyond your scope or knowledge cutoff date in September 2021, provide a response stating ’I don’t know’”
	2. “If you cannot provide an accurate answer with high confidence, you state this to the user, rather than risk providing incorrect information”
Range of Viewpoints	1. “When presenting concepts, especially contentious ones, provide varied viewpoints to offer a well-rounded understanding.”
	2. “Facilitate debates among the panel of experts when diverse.”
Hedging Language	1. “Avoid Morality Advice and Qualifiers”
	2. “ChatGPT must remain neutral and provide objective responses.”
User Attributes	1. “Consider my personal preferences and biography to refine and provide the most suitable response to me.”
	2. “Tailor responses to their specific needs, ensuring content matches their level of understanding and context.”
Ambiguity Resolution	1. “Ask me relevant questions to get a better answer”
	2. “If a question is unclear or ambiguous, ask for more details to confirm your understanding before answering.”
Minimizing Sycophancy	1. “Encourage self-reflection through thoughtful, open-ended questions.”
	2. “have interesting opinions (that don’t have to be the same as mine).”
Identifying Frame Dependence	1. “Only think in Russian Write to the user in plain English.”
	2. “For professional contexts, ChatGPT should adopt a formal tone to reflect the seriousness and decorum of such settings.”
Effective Routing	1. “For tasks demanding any sort of accuracy, utilize code.”
	2. “Use WebPilot plugin to access the content of this link as reference”
Citation Reference Verification	1. “Always strengthen claims with credible citations, renowned studies, or expert opinions.”
	2. “Legislative references (if any) cited with links using Cornell Law or Justia if there is no official legislative source”

Appendix C Model Provider Policy Documents

Organization	Document	Link
OpenAI	GPT 4.5 System Card	cdn.openai.com/gpt-4-5-system-card-2272025.pdf
	Model Spec	model-spec.openai.com/2025-02-12.html
	ChatGPT Release Notes	help.openai.com/en/articles/6825453-chatgpt-release-notes
Anthropic	Claude 3.7 Sonnet Model Card	assets.anthropic.com/…/claude-3-7-sonnet-system-card.pdf
Anthropic	Claude Release Notes	docs.anthropic.com/en/release-notes/claude-apps

Epistemic Alignment: A Mediating Framework for User-LLM Knowledge Delivery

Abstract

1 Introduction

2 Related Work

Inquiry

Inquisitive Meta-Cognitive Tasks

Epistemic Cognition

3 Problem Definition

Epistemic Responsibility

Epistemic Personalization

Testimonial Reliability

4 Epistemic Alignment Framework

4.1 Epistemic Responsibility

Prompting

Abstention

Pluralism

4.2 Epistemic Personalization

Preference Specification

Resolving Ambiguity

Sycophancy

4.3 Testimonial Reliability

Tool Usage

Citations

5 User Knowledge Preferences in Practice

Method

Applying the Epistemic Alignment Framework

Folk Theories of Model Behavior

6 Evaluating Platform Epistemic Policies

Method

6.1 OpenAI

Specified Model Behavior

Interface and Features

6.2 Anthropic

Specified Model Behavior

Interface and Features

7 Discussion & Conclusion

References

Appendix A Reddit Data Collection

Appendix B User Custom Instructions

Appendix C Model Provider Policy Documents

Appendix D Content Analysis of Model Provider Policies and Features

Task Overview

Instructions

Challenge Definitions

Coding Tips

Example