Publishing Confidential

Heidi McCahan

Jul 24

Hi there! A recent episode of the Novel Marketing podcast talks about optimizing author websites for discoverability in the age of AI. Perhaps listening or reading the transcript will help answer your questions?

Expand full comment

Steph Rae Moran

Jul 24

Thanks for mentioning this! I listen to the Novel Marketing podcast regularly, too. Just this past weekend I saw that he had a few episodes on AI optimization, so I listened to them. I think the two most helpful are the one that covers AI optimization in general for authors and the one that focuses on optimizing book pages.

Expand full comment

Becca Behnke

Jaclyn Paul, aka Lena George

Gosh, I am a bit scared thinking of all of the AI inserting itself into our lives. Makes me want to stick to my notebook and pen and hide all my good stuff away in my book desk which looks like an old fashioned pulpit.

Expand full comment

Okay, I've been wondering about this! A reader emailed me a few months ago and told me she discovered me through ChatGPT. That was a real new one for me. Apparently she was using it to get resources for adult ADHD and my book was the first one it recommended! I had no idea why or how, though, or if this was something authors had any influence on. I believe this book was *not* in the pirated dataset these LLMs were trained on, so it's not that -- it's something more organic. Like word of mouth, but AI. Really interesting.

Expand full comment

Kathleen Schmidt

Think of it as Google. That is what it is turning into.

Expand full comment

Jack Lively

More like Wikipedia if you’re using ChatGPT.

Here’s a rundown of the top three user-facing AI apps—ChatGPT, Claude, and Gemini—with a focus on where they likely source their information (in order of preference or reliability, based on known practices):

⸻

🧠 1. ChatGPT (OpenAI)

Models: GPT-4, GPT-4o

Interface: chat.openai.com, API, desktop & mobile apps

Primary Information Sources:

• ✅ Licensed datasets (e.g. books, Wikipedia, technical documentation)

• ✅ Public web data (up to training cutoff, e.g. Common Crawl, public forums)

• ✅ Third-party partnerships (e.g. integration with Microsoft Bing for real-time search)

• ⚠️ No access to proprietary subscription content unless explicitly licensed

Notes:

• GPT-4o has multimodal capabilities (text, image, voice, code, video input).

• ChatGPT integrates with tools (e.g. Python, web browsing) in pro versions.

⸻

🤖 2. Claude (Anthropic)

Models: Claude 2, Claude 3 family

Interface: claude.ai, Slack plugin, API

Primary Information Sources:

• ✅ Public domain content (e.g. Wikipedia, Project Gutenberg)

• ✅ Publicly available web data (carefully filtered for safety and fairness)

• ✅ Anthropic’s own curated datasets (aligned with constitutional AI values)

• ❌ No direct web access or external tool integration

Notes:

• Claude is known for its strong alignment and polite tone.

• Claude 3 Opus matches or outperforms GPT-4 in some reasoning and language tasks.

⸻

🌐 3. Gemini (Google DeepMind)

Models: Gemini 1.5 family

Interface: gemini.google.com, Google Workspace (Docs, Gmail, etc.), API

Primary Information Sources:

• ✅ Google Search index & Knowledge Graph

• ✅ YouTube transcripts, Google Books, Google Scholar (selective data use)

• ✅ Public web data (curated and filtered)

• ✅ Internal Google content (selectively, depending on use case and privacy settings)

Notes:

• Deep integration with Google products gives Gemini real-time info and app-level knowledge.

• Gemini can retrieve and reason over live data via search better than Claude or base ChatGPT.

⸻

⚖️ Summary Comparison – Data Sources Preference Order

Rank ChatGPT Claude Gemini

1️⃣ Licensed datasets Public domain Google Search & internal products

2️⃣ Public web data Curated web data Public web data

3️⃣ Bing search (pro version) In-house curation YouTube, Scholar, etc.

Expand full comment

Joyce Reynolds-Ward

Hmm. I hadn’t thought I was seeing much response from my Instagram postings. I’ll have to reconsider that.

Expand full comment

Christa M. Hines

Just signed up for the class. Looking forward to learning from you!

Expand full comment

Megan Woodgrave

I appreciate the info about google search, but was this written partly by AI? I ask in part because I find many of the AI enthusiasts here on Substack (logically) have no qualms about using it to generate newsletter content. Also, I'm confused about some of the writing - what is "how to start a conversation about this"? Who are you talking to?

I prefer not to pay for AI generated content, as I disagree with the ethics behind it, so I'd appreciate the disclosure either way. Thanks!

Expand full comment

Kathleen Schmidt

If you have read this newsletter for any amount of time, you'd know I do not use AI to write it. I am a bit insulted that you're asking me that.

What do you think "How to start a conversation" means? It's literally how you'd begin a conversation talking about all of this.

Please don't pay for this newsletter. Learning about AI and being an AI enthusiast are two very different things.

Expand full comment

Megan Woodgrave

Jul 17

Thanks Kathleen, I appreciate the answer, though I'm puzzled that the question offended you. You've mentioned in the past that you subscribe to multiple AI services and use it regularly for work...I assume this newsletter is work? You also say you use Claude for writing. Given this, why would it surprise you that some people wonder if you use it to generate newsletter content, or would want clarification on that?

Expand full comment

Kathleen Donahoe

Thanks Kathleen- this is so important. I feel entirely daunted by the captioning, but always glad to know what new nonsense I need to learn. Appreciate your very basic explanation, we need it.

Expand full comment

Cassandra Powers

A terrific summary for those of who prefer not wade through the weeds, thank you! Wondering if you have a quick example of 'proactive' publicity, v. 'reactive' publicity?

Expand full comment

Kathleen Schmidt