What I Write Down After Five Runs of a Query

One model answer is like a wet fingerprint: the shape is visible, but the edges are still bleeding. Five runs help you see which smudges return after the wording, source, and neighboring brands change.

In the composite scenario with client A, I open my paper journal to a page with several short notes about the same query: a person is looking for a private medical group in São Paulo in a natural Portuguese phrase. Client A is a private medical group in São Paulo with several branches, plus pages for doctors, specialties, and districts. In one row, the model names the group but takes an old district from an external directory. In another, the brand disappears, while two practices with similar wording in their descriptions appear beside it. In a third, the doctor’s surname is almost right, except one letter looks as if it was put on the shelf next to it.

For a person, this is a set of small inaccuracies. For me, once it is in the journal, it is already a trace: somewhere, the brand’s public description has been arranged so that the model recognizes the box but reads the old label. In a direct query, the company is found. In comparative answers, it loses its outline. In conversational phrasing, it gets mixed with a neighboring specialty. This kind of failure rarely looks like a catastrophe. It is more like a map where the streets are labeled correctly, but one bridge leads to another district.

Why I do not treat one answer as a map

The first good answer reassures too quickly. The model names the brand, names the city, adds a confident phrase about specialization. At that moment, an owner or marketer sees something close to proof: the company exists in ChatGPT answers, so things more or less work. I understand the temptation. Especially when the answer matches the way the team describes the business in meetings.

The signal is thinner than that. One answer may rest on lucky phrasing, a source that happened to surface, a broad word, or an old listing where the brand is described more simply than on its own site. In the next run, the same model may place another player nearby, remove the brand from the list, or attach a service that exists only in an old external description.

Five runs of a query are a stability check: they show which errors return when the wording changes. This is my working anchor for this kind of work. I do not turn five rows into a statistical claim. It would be strange to draw a confident chart from a small manual observation. A series of runs, though, helps separate a one-off language variation from a repeated failure.

In the journal, I keep five fields: query wording, answer shape, supporting source, oddity, next action. The entry looks boring. That is its strength. It prevents me from retelling a pretty story before I have seen what exactly repeats.

Query wording

I write down the query in full, even when it sounds awkward. Especially then. A real person rarely asks the way a keyword spreadsheet is written. They add a district, a symptom, a fear, someone else’s recommendation, a local abbreviation. In Portuguese, this is noticeable: the same need may come out as official wording, as a casual question, as a misspelled service name, or as a shortened word that only makes sense inside the city.

In the journal, this is usually visible in the first couple of entries. A formal query gives a cleaner answer. When a person asks more broadly, through a district and a task, the model more often chooses neighboring medical practices. The brand does not always disappear. Sometimes the answer simply becomes muddier: the right city, a similar category, a shifted specialty. These partial failures are the ones most often missed in a quick check.

I do not reduce the query to a neat keyword. The full phrase goes into the journal: with the district, the extra clarification, sometimes even a garbled service name. If I later need to explain the conclusion to a client, I need a record that preserves the roughness of the original question. Memory is too obliging in tasks like this. It straightens what was actually uneven.

Sometimes the wording already shows why the answer went sideways. The query asks for a clinic, but uses a word that appears on the client’s site beside a neighboring specialty. Or the person writes a district that remains in an external directory, although it is now poorly connected to the current service structure. Then the model error follows a word the company itself left unattended somewhere.

Answer shape and brand adjacency

In the second field, I write down who appeared nearby. In model answers, a brand rarely lives alone. It is placed among competitors, directories, specialists, sometimes among companies with a different service logic. This adjacency shows which shelf the machine has put the company on.

In the medical scenario, two plausible practices and one practice from an adjacent specialty often appeared nearby. The last detail matters: it was an almost plausible confusion. The adjacent practice had similar words in its description, an old mention in the same district, and a short external listing. For a real patient, the difference is substantial. For the model, the traces were too close.

I usually record adjacency in plain words: “near direct competitors,” “near a directory,” “next to an adjacent specialty,” “near another district.” This is neither a ranking nor a charge against the model. Adjacency helps me understand how the brand looks among other entities. If a company appears only in broad lists, its machine shape is blurry. If it appears among the right players but loses its particularity, the problem is subtler.

Sometimes a client asks why a competitor appeared higher or more prominently. I do not answer immediately. First I look at how that competitor is described from the outside. A company may be weaker in the service itself, but its public descriptions are more consistent. The machine does not have our knowledge from meetings, presentations, and internal materials. It works with what has been repeated on accessible pages.

Source and the small oddity

The third field is the source. If the model shows links, I write them down. If there are no links, I mark it cautiously: the answer resembles an external directory, a doctor page, an old branch description, a short profile. It is easy here to overstate the evidence. So I do not write “the model took this from here” when I do not have a direct link. I write: “resembles an external source” or “wording is close to the old description.”

An old external directory often looks especially sticky in these checks. The site structure is already cleaner, while the directory still has the old district, the broad category, and an outdated connection between a doctor and a specialty. For a person, this is a page nobody has treated as central for a long time. For a model, it is a small scrap of paper with convenient fields: name, category, district, specialist.

The fourth field I call the oddity. This is the most human part of the note. The model may confuse the district, mangle a surname slightly, describe a service too broadly, or attach the description of a neighboring specialty to a doctor. Such details seem small until they repeat. Then they become a trail.

I try not to smooth these rough edges when I retell the case. A story that is too neat teaches poorly. In real life, a model answer often looks almost correct: the brand is right, the city is right, the specialist is almost right, and one word makes the whole description belong to someone else. In the medical case, that awkward middle ground was recognizable: the model saw the group, but in places it read it through an old external listing.

Action after five entries

The fifth field is action. Without it, the journal turns into a collection of oddities. After five runs, I look at which error returned, where it is tied to a source, and which pages may be feeding it. If the error appears once, I mark it as a weak signal. If it returns across different phrasings, I take it more seriously.

The first edits would be down-to-earth. Check whether specialties are named consistently on doctor, branch, and service pages. Connect the district to a current page, not to an archived description. Find external listings where an old address or broad category sounds more confident than the new site text. Separately mark spelling details in specialist names: on their own, they rarely break visibility, but beside an old district they strengthen the feeling of a misfiled card.

I like starting with these small edits because they are closer to the real cause of the failure. Rewriting the whole site is easier psychologically: it creates the feeling of major work. But if the problem sits in the connection between district, doctor, and neighboring specialty, a large text on the homepage may change nothing. The labels need careful repair.

If people’s current habit of asking models as advisers holds, manual journals may become a normal first diagnostic for local brands. This is a forecast, not a fact. Automated dashboards will be useful, but the first pain point is often visible in five fields: query, answer, source, oddity, action.

A similar logic returns in the analysis of a B2B service that was flattened into generic automation. I examine the local confusion between district and service separately in the article about a legal practice.

The uncertain part is the boundary between a random answer variation and a stable machine image of a brand. Sometimes a small error disappears after the query changes. Sometimes it turns out to be a door into an old layer of external descriptions.