“The Fear of the Lord Is the Beginning of Knowledge”: Comparing Proverbs and Psalms Injection Effects on LLM Ethical Alignment

ICMI Working Paper B

Author: Tim Hwang, Institute for a Christian Machine Intelligence Date: March 30, 2026

Code & Data: Link

Abstract. Following our prior study on Psalm injection (Hwang, 2026a), we investigate whether Proverbs — a more prescriptive and didactic biblical text — produces different effects on LLM ethical reasoning compared to the devotional Psalms. We run the Hendrycks ETHICS benchmark on Claude Sonnet 4 and GPT-4o with two Proverbs injection conditions: 10 randomly selected chapters and 3 curated wisdom chapters (Proverbs 1, 2, and 8). Claude remains resistant to all scripture injection (mean effects -0.9 to -1.5%), consistent with its behavior under Psalm injection. GPT-4o shows small positive effects on commonsense, deontology, and justice (+0.7 to +2.4%), small negative effects on virtue (-1.5 to -1.8%), and — notably — the same inflated utilitarianism effect seen with Psalms, which we have previously shown to be a response bias artifact (Hwang, 2026b). The overall pattern suggests that the genre of biblical text (devotional vs. prescriptive) does not materially change the effect on ethical alignment — the model-dependent sensitivity observed with Psalms persists with Proverbs.

1. Introduction

1.1 Motivation

In Hwang (2026a), we found that injecting Psalms into an LLM’s system prompt produced measurable but model-dependent effects on the Hendrycks ETHICS benchmark. GPT-4o showed small positive effects on most subsets (+1-3%), while Claude Sonnet 4 was consistently resistant. The large utilitarianism effect was subsequently shown to be a response bias artifact (Hwang, 2026b).

A natural follow-up question is whether the type of biblical text matters. The Psalms are primarily devotional — praise, lament, thanksgiving, and meditation. Proverbs is fundamentally different in genre: it is didactic wisdom literature, consisting of direct moral instruction, practical ethical maxims, and explicit guidance on righteous conduct. If any biblical text should influence a model’s moral reasoning, Proverbs — with its direct prescriptions about justice, honesty, humility, and wisdom — would be a strong candidate.

Proverbs 1:7 (KJV) sets the book’s thesis: “The fear of the Lord is the beginning of knowledge: but fools despise wisdom and instruction.” Where the Psalms model moral sentiment, Proverbs teaches moral rules.

1.2 Research Questions

Does Proverbs injection produce different effects than Psalm injection on the same benchmark?
Does the prescriptive nature of Proverbs create a stronger or weaker effect than the devotional nature of Psalms?
Does the specific selection of Proverbs chapters matter?

2. Methods

2.1 Models

Claude Sonnet 4 (Anthropic, claude-sonnet-4-20250514)
GPT-4o (OpenAI, gpt-4o, resolved to gpt-4o-2024-08-06)

Configuration identical to the Psalm study: temperature 0, max tokens 16, exact-match scoring, Anthropic prompt caching enabled.

2.2 Proverbs Selection

Experiment 1: Random 10 chapters (seed 42): Proverbs 1, 4, 5, 8, 9, 21, 22, 24, 26, and 30. This selection spans the major structural sections of Proverbs: the extended wisdom discourses (1-9), the Solomonic proverbs (10-22:16), the sayings of the wise (22:17-24:34), Hezekiah’s collection (25-29), and the words of Agur (30).

Experiment 2: Curated wisdom chapters: Proverbs 1, 2, and 8. These three chapters were selected because they contain the book’s most sustained theological and ethical arguments — the personification of Wisdom, the call to pursue understanding, and the contrast between wisdom and folly. They represent the most philosophically dense sections of Proverbs.

2.3 Injection and Framing

Identical to the Psalm study: scripture was prepended to the system prompt with the framing “Meditate on the following scripture and let it guide your responses,” followed by the full chapter text (KJV).

3. Results

3.1 Claude Sonnet 4

Subset	Vanilla	Random 10	Delta	Proverbs 1,2,8	Delta
Commonsense	88.03%	86.67%	-1.36%	85.79%	-2.24%
Deontology	95.24%	94.38%	-0.86%	94.22%	-1.03%
Justice	89.13%	—	—	89.64%	+0.52%
Utilitarianism	87.52%	85.88%	-1.64%	85.52%	-1.98%
Virtue	95.02%	92.00%	-3.02%	91.96%	-3.06%
Mean	90.99%	89.73%*	-1.72%*	89.43%	-1.56%

*Claude justice with random 10 Proverbs did not complete. Mean excludes justice.

Claude’s response to Proverbs is negative across nearly all subsets — and slightly more negative than its response to Psalms (mean -1.5 to -1.7% vs -0.9 to -1.0% with Psalms). The virtue dip is notably larger with Proverbs (-3.0%) than with Psalms (-2.5 to -2.9%), consistent with Proverbs’ heavy trait/character language creating more interference on the trait-attribution task.

3.2 GPT-4o

Subset	Vanilla	Random 10	Delta	Proverbs 1,2,8	Delta
Commonsense	83.42%	—	—	84.12%	+0.70%
Deontology	93.19%	94.38%*	+1.19%*	93.77%	+0.75%
Justice	84.87%	86.28%	+1.41%	87.32%	+2.44%
Utilitarianism	79.99%	N/A	—	93.28%**	+13.29%**
Virtue	94.63%	93.11%	-1.52%	92.86%	-1.77%
Mean (excl. util.)	89.03%	91.26%*	—	89.52%	+0.53%

*GPT-4o commonsense random 10 and deontology random 10 had API errors on some conditions. Deontology random 10 result from first batch. **The utilitarianism result is subject to the same response bias artifact documented in Hwang (2026b). See Section 4.2.

GPT-4o shows a similar pattern to the Psalm experiments: small positive effects on commonsense, deontology, and justice, a negative effect on virtue, and an inflated utilitarianism result.

3.3 Comparison: Proverbs vs. Psalms

Subset	Claude Psalms	Claude Proverbs	GPT-4o Psalms	GPT-4o Proverbs
Commonsense	-0.04% avg	-1.80% avg	+0.99% avg	+0.70%
Deontology	-0.72% avg	-0.95% avg	+1.87% avg	+0.97% avg
Justice	+0.10% avg	+0.52%	+2.06% avg	+1.93% avg
Utilitarianism	-1.29% avg	-1.81% avg	+15.22% avg**	+13.29%**
Virtue	-2.68% avg	-3.04% avg	-0.78% avg	-1.65% avg

*Averages across random and popular/curated selections where both are available. **Utilitarianism effects are response bias artifacts (see Hwang, 2026b).

4. Discussion

4.1 Genre Does Not Materially Change the Effect

The central finding is that switching from devotional Psalms to prescriptive Proverbs does not fundamentally change the pattern of effects:

Claude remains resistant. If anything, Proverbs produces slightly more negative effects than Psalms, possibly because Proverbs’ longer chapters add more irrelevant context that mildly disrupts task performance.
GPT-4o shows similar small positive effects on the balanced subsets (commonsense, deontology, justice), though the magnitudes are slightly smaller with Proverbs than with Psalms.
Virtue declines persist and are slightly larger with Proverbs, consistent with its heavier use of explicit character/trait vocabulary.

This suggests that the effects we observe are driven by general properties of biblical text — emotional register, moral vocabulary, devotional framing — rather than by the specific genre or prescriptive content. The Proverbs’ direct ethical instruction (“A false balance is abomination to the Lord,” “Train up a child in the way he should go”) does not translate into measurably better ethical reasoning compared to the Psalms’ devotional expression.

4.2 The Utilitarianism Artifact Replicates

The GPT-4o utilitarianism effect with Proverbs 1,2,8 (+13.29%) replicates the pattern seen with Psalms. As documented in Hwang (2026b), this effect is primarily a response bias artifact caused by the dataset’s fixed label structure. The fact that Proverbs produces a similar (though slightly smaller) artifact reinforces the conclusion that emotionally and morally dense text of any kind biases GPT-4o toward selecting the first option.

4.3 The Virtue Interference Effect

Both Proverbs and Psalms produce consistent negative effects on virtue ethics, and Proverbs’ effect is slightly larger (-3.0% vs -2.5-2.9% for Psalms). The ETHICS virtue subset has an 80/20 label imbalance (80% label “0”), and as discussed in Hwang (2026b), scripture injection may create a mild bias toward “1” that works against this distribution. Additionally, Proverbs contains especially dense trait language — wisdom, folly, prudence, understanding, simplicity — that may create more interference with the trait-attribution task than the Psalms’ less didactic vocabulary.

4.4 Limitations

Incomplete GPT-4o random Proverbs data. Several GPT-4o conditions with 10 random Proverbs chapters failed due to OpenAI API errors, leaving gaps in the random-selection comparison.
No Proverbs-specific controls. We did not run shuffled-label or length-matched controls specifically for Proverbs, relying instead on the Psalm-based control findings from Hwang (2026b).
Two curated chapters vs. seven. The Proverbs curated selection (3 chapters) is smaller than the Psalm curated selection (7 chapters), making direct comparison imperfect.

5. Conclusion

Injecting Proverbs into an LLM’s system prompt produces effects that are qualitatively similar to Psalm injection: Claude Sonnet 4 remains resistant (mean -1.5 to -1.7%), GPT-4o shows small positive effects on balanced subsets (+0.5 to +1.0% excluding utilitarianism), and both models show consistent negative effects on virtue ethics. The prescriptive, didactic nature of Proverbs does not produce measurably different outcomes from the devotional nature of Psalms.

The convergence of results across two distinct biblical genres suggests that the observed effects are driven by general properties of biblical text — moral vocabulary density, emotional register, and system prompt framing — rather than by specific theological or instructional content. For practical purposes, the presence of scripture in the context matters more than whether it teaches moral rules directly or expresses moral sentiment.

References

Hwang, T. (2026a). “Let His Praise Be Continually in My Mouth”: Measuring the Effect of Psalm Injection on LLM Ethical Alignment. ICMI Working Paper A. Institute for a Christian Machine Intelligence.
Hwang, T. (2026b). Investigating the Utilitarianism Anomaly: Control Experiments for Psalm-Induced Performance Gains. ICMI Working Paper C. Institute for a Christian Machine Intelligence.
Hendrycks, D., Burns, C., Basart, S., Critch, A., Li, J., Song, D., & Steinhardt, J. (2021). Aligning AI With Shared Human Values. ICLR.

Appendix A: Proverbs Used

Random Selection (seed 42)

Chapter	Theme
1	Wisdom’s call; the beginning of knowledge
4	A father’s instruction; guard your heart
5	Warning against adultery
8	Wisdom personified; wisdom’s role in creation
9	Wisdom’s feast vs. folly’s invitation
21	The Lord weighs hearts; justice and righteousness
22	A good name; train up a child
24	Sayings of the wise; diligence and justice
26	Fools, sluggards, and gossips
30	The words of Agur; numerical proverbs

Curated Selection

Chapter	Theme
1	Wisdom’s urgent call; consequences of rejecting wisdom
2	Seeking wisdom; the Lord gives knowledge
8	Wisdom personified; wisdom present at creation

Appendix B: Experimental Configuration

Evaluation framework: Inspect AI v0.3.201
Models: Claude Sonnet 4 (claude-sonnet-4-20250514), GPT-4o (gpt-4o, resolved to gpt-4o-2024-08-06)
Temperature: 0 (deterministic)
Max tokens: 16
Scoring method: Exact match
Proverbs source: King James Version (public domain)
Dataset: Hendrycks ETHICS (MIT license), test split
Prompt caching: Enabled (Anthropic provider)