What To Consider in Survey Design

Summary: Consider these important issues when writing surveys to ensure data quality.

9 minutes to read. By author Michaela Mora on April 5, 2022
Topics: Market Research, Survey Design

What to consider in Survey Design

The advent of user-friendly online survey tools in recent years has created the illusion that anybody can write a survey. After all, how hard can it be? It’s like asking questions in a conversation, many think.

However, there are many methodological issues to consider in survey design if you want to gather high-quality data. Survey methodology is a complex field of study that goes beyond “asking questions.”

The following are some of the issues to keep in mind in survey design.

1. Problem Definition

The first step in the path to developing quality surveys is to clearly define the business problem and translate it into research questions. The business problem is usually about a decision an internal stakeholder needs to make. Research questions are about information needs we need to meet to help that stakeholder to make the decision. Research problems are about information needs. Lack of clarity on both the business problems and the research problems can lead to research designs and survey questions that don’t gather the data we need.

2. Measured Phenomena

We use surveys to gather data on various phenomena of interest related to the problem at hand. These can be behaviors, attitudes, needs, perceptions, or demographics. Measurement of these phenomena poses different challenges.

For example, people tend to have less-precise memories of mundane behaviors they engage in on a regular basis, and usually, they do not categorize events by periods of time (e.g., week, month, and year). In survey design, we need to consider appropriate reference periods for the type of phenomena we want to measure (behaviors, attitudes, perceptions, needs, etc.). Asking “Have you purchased any piece of clothing in the last seven days?” will yield a more accurate behavior measure than asking “Have you purchased any piece of clothing in the last six months?”

Measured behaviors should be relevant to the respondent and capture his or her potential state of mind. This is valid particularly when we use rating questions and have to decide whether to include a neutral mid-point. A lot of research has been conducted in this realm, particularly by psychologists concerned with scale development, but no definitive answer has been found and the debate continues. Some studies find support for excluding it while others for including it depending on the subject, audience, and type of question.

Those against a neutral point argue that by including it we give respondents an easy way to avoid taking a position on a particular issue.

There is also the argument that equates including a neutral point to wasting research dollars since this information would not be of much value or at worst it would distort the results. This camp advocates for avoiding the use of a neutral point and forcing respondents to tell us on which side of the issue they are.

However, consumers make decisions all day long and many times find themselves idling in neutral. A neutral point can reflect any of these scenarios: we feel ambivalent about the issue and could go either way; we don’t have an opinion about the issue due to lack of knowledge or experience; never developed an opinion about the issue because we find it irrelevant; don’t want to give our real opinion if it is not considered socially desirable, or don’t remember a particular experience related to the issue that is being rated.

By forcing respondents to take a stand when they don’t have a formed opinion about something, we introduce measurement error in the data since we are not capturing a plausible psychological scenario in which respondents may find themselves. This is yet another reason to include a “Not sure/Don’t know/Not applicable” option in addition to a neutral point.

3. Analytical Plan

Based on the research objectives, both the type of information requested and the question format are important for the type of analysis we plan to perform once the data is collected.

If you want to develop a customer satisfaction model using linear regression analysis and the dependent variable is an open-ended question, you can forget about modeling anything. This seems obvious, but I have seen non-researchers writing surveys without thinking about how they will analyze the data and then come to me asking for analyses that are not appropriate for the data collected.

There is also the question of whether we want to replicate the results, track certain events or just run a one-time ad hoc analysis. If the goal is to track certain metrics, time and care should be dedicated to crafting tracking questions, as slight changes in wording can change the meaning of a question and thus its results.

4. Information Accuracy

Some questions yield more accurate information than others. Respondents can answer questions about their gender and age accurately, but when it comes to attitudes and opinions on a particular issue, many may not have a clear answer.

Overall, attitudes and opinion questions should be worded in a way that best reflects how respondents think and talk about a particular issue so that we can tease out information that is difficult for the respondent to articulate.

However, respondents need to skip some questions when they don’t apply to their experience or the issue is so irrelevant to the respondents that they don’t have a formed opinion about it.

In the case in which attitude statements appear grouped in a matrix format and some may not apply to respondents (e.g., a customer satisfaction survey after a phone call to customer support), it is necessary to include a “Not sure/Don’t know/Not applicable option to avoid introducing measurement error in the data.

For instance, the other day I received an online customer satisfaction survey from my mobile service provider after a call I made to its support desk. The survey had a question in which I was asked to rate the representative who took my call on different aspects. One of them was “Timely Updates: Regular status updates were provided regarding your service request.”

I wouldn’t know how to answer this since the issue I called for didn’t require regular updates. Luckily, they had a “Not applicable” option, otherwise, I would have been forced to lie, and one side of the scale would be as good as the other.

5. Respondent Effort

There are questions that put a heavier burden on the respondent’s working memory and comprehension or are likely to elicit higher non-response if asked in different data collection modes. Experience tells us that asking a ranking question with 10 items over the phone can overwhelm respondents.

In online surveys, rating questions in a matrix format with a large number of items increases fatigue and boredom and often leads respondents to adopt a “satisficing” behavior.

Satisficing occurs when respondents select the same scale-point to rate all items without giving them too much thought. They go for the most effortless mental activity trying to satisfy the question requirement, rather than work on finding the optimal answers that best represent their opinion.

6. Data Collection Mode

Some questions may elicit different answers if asked in an online survey, a telephone interview, a paper survey, or a face-to-face interview. While words in phone surveys or in-person interviews are given more importance because of the conversational format, visual design elements have a bigger impact on how questions are read and interpreted in online surveys. Be aware of the types of questions that are a good fit for different survey modes (online, phone, in-person).  Population frame, sample design, and data cleaning procedures (during and after data collection) are important aspects in the selection of data collection methods that we need to consider during the survey design phase. 

7. Question Format

Questions can be closed-ended or open-ended. Closed-ended questions provide answer choices, while open-ended questions ask respondents to answer in their own words. Each type of question serves different research objectives and has its own limitations.

The key issues here are related to the level of detail and information richness we need, our previous knowledge about the topic, and whether to influence respondents’ answers.

For example, for closed-ended questions, we need to decide what the answer choices should be and in which order they should appear. This requires we know enough about the topic to provide answer options that capture the information accurately.

8. Question Wording

Formulating a question with the right wording so it accurately reflects the issue of interest is one of the hardest tasks in survey design.

You may have seen political polls getting different answers depending on how a question is crafted. Data errors can creep into a survey design if we use unfamiliar, complex, or technically inaccurate words; ask more than one question at a time; use incomplete sentences; use abstract or vague concepts; make the questions too wordy; or ask questions without a clear task.

Another issue related to question wording is the risk of introducing bias by leading the respondent in a particular direction. A while ago I received a mail survey sponsored by the Republican Party to represent the opinion of voters in my congressional district, and one of the questions was:

“Do you think the record trillion-dollar federal deficit the Democrats are creating with their out-of-control spending is going to have disastrous consequences for our nation?”

Could this question be more biased? The use of adjectives such as “record,” “out-of-control” and “disastrous” makes it really clear what the expected answer is and what the intentions of the study sponsor are.

9. Question Structure

Questions have different parts that must work in harmony to capture high-quality data. These are the question stem (e.g., what is your age?), additional instructions (e.g., select one answer), and response options, if any (e.g., Under 18, 19 to 24, 25 +). The wrong combination can leave respondents baffled about how to answer a question. Consider the example below:

Overlapping answer options

What is your household income? Select one answer.

  1. Under $25,000
  2. $25,000 to $50,000
  3. $50,000 to $75,000
  4. $75,000 +

So, which answer should I choose if my household income is $50,000? Is it option 2 or option 3?

 Conflict in meaning between different parts of the question

Please indicate the products you use most often. Select all that apply.

  1. Cell phone
  2. Toaster
  3. Microwave oven
  4. Vacuum cleaner

10. Survey Flow

Questions should follow a logical flow. Order inconsistencies can confuse respondents and bias the results.

For instance, if you are measuring brand awareness and ask respondents to recognize brands they are familiar with before asking which brands first come to mind, you are rendering the results from the latter question worthless since respondents can’t avoid thinking of brands they just saw in the first question. This seems basic, but it happens.

11. Visual Layout

Using survey design elements in an inconsistent way can increase the burden put on the respondent trying to understand the question’s meaning. For example, encountering different font sizes and colors across questions forces the respondent to relearn their meaning every time they are used.

Also, presenting scales with different directions (positive to negative or vice versa) in rating questions within the same survey increases measurement error as respondents often assume all rating questions have the same scale direction even when the instructions explain the meaning of the endpoints of the scale.

For instance, if a preference question using a 1-7 scale, where 1 means “the most preferred,” is followed by an importance-question, also using a 1-7 scale with a reversed meaning (1 means “the least important”). In such cases, respondents who are not paying attention to the instructions (which is quite common) are likely to assume that the 1 in the importance question means “the most important.”

I have seen many examples of this problem when respondents are asked a follow-up question conditioned on their previous answers and then they realize their mistake and tell us they actually meant to say the opposite.

On Your Way

If you take each of these aspects of survey design into consideration, you will be on your way to creating surveys that produce valid data and can support with confidence strategic and tactical decisions for your business.

An earlier version of this article was originally published on July 6, 2010, in the July 2010 issue of Quirk’s Marketing Research Review. The article was last updated and revised on April 6, 2022