O
Octo
O
Octo
CoursesPricingDashboardPrivacyTerms

© 2026 Octo

Data Skills Essentials
1What Is Data Analytics?2SQL Basics3Data Visualization4Excel & Google Sheets Power User5Data Cleaning & Preparation6Advanced SQL7Power BI Fundamentals8Tableau Fundamentals
Module 4

Excel & Google Sheets Power User

Excel runs the world. Here's how to go from basic user to power user — VLOOKUP, pivot tables, data cleaning tricks, and the formulas that make analysts dangerous.

The intern who replaced a $50,000 consultant

In 2019, a logistics company in Chicago was paying a consulting firm $50,000 per quarter to produce a report. The report showed shipping costs by region, identified late deliveries, and flagged vendors who were overcharging. It took the consultants three weeks to deliver.

A summer intern — a junior in college who had taken one business class — asked to see the raw data. The company gave her access to the same CSV exports the consultants received. In two days, using nothing but Google Sheets, she built a workbook with three tabs: a pivot table summarizing costs by region and vendor, a VLOOKUP that cross-referenced invoices against contracted rates, and conditional formatting that turned overcharges red.

The intern's spreadsheet was more accurate than the consultant's report. It updated instantly when new data was pasted in. And it cost exactly $0.

The company didn't renew the consulting contract.

(Illustrative scenario based on common patterns in business analytics. The leverage of spreadsheet skills over outsourced reporting is well-documented across industries.)

This is what spreadsheet mastery looks like. Not fancy macros or VBA code — just knowing the right 20% of features that solve 80% of problems.

The formula toolkit every analyst needs

You don't need to memorize 400 functions. You need these 12. They cover the vast majority of real-world analysis.

FormulaWhat it doesExample
SUMAdds numbers=SUM(B2:B100)
AVERAGEMean of a range=AVERAGE(B2:B100)
COUNT / COUNTACount numbers / non-empty cells=COUNTA(A2:A100)
IFConditional logic=IF(B2>100,"High","Low")
VLOOKUPFind a value in another table=VLOOKUP(A2,Sheet2!A:C,3,FALSE)
INDEX/MATCHFlexible lookup (replaces VLOOKUP)=INDEX(C:C,MATCH(A2,A:A,0))
SUMIF / COUNTIFConditional sum or count=SUMIF(A:A,"East",B:B)
CONCATENATE / &Join text strings=A2&" "&B2
TEXTFormat numbers as text=TEXT(A2,"$#,##0")
LEFT / RIGHT / MIDExtract parts of text=LEFT(A2,3)
IFERRORHandle errors gracefully=IFERROR(VLOOKUP(...),"Not found")
UNIQUE (Google Sheets)Deduplicate a list=UNIQUE(A2:A100)
🔑The VLOOKUP vs INDEX/MATCH debate
VLOOKUP is easier to learn. INDEX/MATCH is more powerful — it can look left (VLOOKUP can only look right), handles column insertions without breaking, and is faster on large datasets. Learn VLOOKUP first, then switch to INDEX/MATCH. In Google Sheets, XLOOKUP combines the best of both.

VLOOKUP explained like you're 10

Imagine you have a class roster with student names and grades in one binder. A parent calls asking about their kid's grade. VLOOKUP is you flipping to the right page, finding the name in the first column, then sliding your finger right to the grade column.

=VLOOKUP(what_you're_looking_for, where_to_look, which_column_to_return, exact_match)
=VLOOKUP("Sarah Chen", A2:D50, 3, FALSE)
  • what_you're_looking_for — "Sarah Chen"
  • where_to_look — the table range (A2:D50)
  • which_column_to_return — column 3 (the grade column)
  • FALSE — exact match only (almost always what you want)

There Are No Dumb Questions

"Why does my VLOOKUP return #N/A?"

Three common reasons: (1) the value doesn't exist in the lookup table — check for typos and extra spaces. (2) You used TRUE instead of FALSE — TRUE does an approximate match, which is almost never what you want. (3) The lookup column isn't the first column of your range — VLOOKUP only searches left to right. Use INDEX/MATCH to search in any direction.

"Should I learn Google Sheets or Excel?"

Both. They're 95% the same. Google Sheets is better for collaboration and has some modern functions (UNIQUE, FILTER, QUERY). Excel has Power Query, Power Pivot, and better handling of massive datasets. Most analysts switch between them daily.

Pivot tables: the most powerful feature you're not using

A pivot table takes thousands of rows and summarizes them into a meaningful table — in seconds, with no formulas.

Example: You have 50,000 rows of sales data with columns: Date, Region, Product, Salesperson, Revenue. You want to know total revenue by region and product.

Without a pivot table: write SUMIFS formulas for every combination of region and product. Tedious, error-prone, breaks when data changes.

With a pivot table: drag "Region" to rows, "Product" to columns, "Revenue" to values. Done. Three seconds.

Step 1: Select your data (including headers)

Step 2: Insert > Pivot Table (Excel) or Data > Pivot table (Sheets)

Step 3: Drag fields into Rows, Columns, Values, and Filters

Step 4: Change the value aggregation (sum, count, average) as needed

Step 5: Sort, filter, and drill down to explore the data

✗ Without AI

  • ✗Write SUMIF for every combination
  • ✗Manually update when data changes
  • ✗One summary view at a time
  • ✗Takes 30+ minutes to build
  • ✗Breaks when new categories appear

✓ With AI

  • ✓Drag and drop fields
  • ✓Auto-updates with new data
  • ✓Pivot to any view instantly
  • ✓Takes 30 seconds to build
  • ✓Handles new categories automatically

⚡

Pivot Table Practice

25 XP
You have a dataset of 10,000 customer orders with columns: Order_ID, Customer_Name, Region, Product_Category, Order_Date, Revenue. Describe how you'd set up a pivot table to answer each question: 1. "What's the total revenue by region?" → Rows: ___ Values: ___ 2. "What's the average order size by product category?" → Rows: ___ Values: ___ (aggregation: ___) 3. "Which region had the most orders in Q1?" → Rows: ___ Values: ___ Filter: ___ 4. "Show me revenue by region AND product category" → Rows: ___ Columns: ___ Values: ___ *Hint: Rows = what you're grouping by. Values = what you're measuring. Filters = what you're restricting.*

Data cleaning in spreadsheets

Real-world data is messy. Before you analyze anything, you clean it. Here are the techniques that save hours.

The dirty dozen: common data problems

ProblemExampleFix
Extra spaces" John Smith "=TRIM(A2)
Inconsistent case"new york", "New York", "NEW YORK"=PROPER(A2) or =UPPER(A2)
DuplicatesSame order logged twiceRemove Duplicates tool or =UNIQUE()
Mixed date formats"03/15/2025" and "15-Mar-2025"=DATEVALUE() then format consistently
Numbers as text"1,234" stored as text (won't sum)Paste Special > Values, or multiply by 1
Merged cellsHeaders spanning 3 columnsUnmerge and fill down
Typos in categories"Electroncs" vs "Electronics"Find & Replace or VLOOKUP to a clean list
Missing valuesBlank cells scattered throughoutFilter blanks, decide: fill, estimate, or flag

There Are No Dumb Questions

"How much time should I spend cleaning data?"

Professional analysts spend 60-80% of their time on data cleaning and preparation. It's the most important and least glamorous part of the job. A beautiful chart built on dirty data is worse than no chart at all — it gives people false confidence in wrong answers.

"Can I automate data cleaning?"

Yes. Excel's Power Query and Google Sheets' QUERY function can automate repeatable cleaning steps. For one-off projects, manual cleaning is fine. For weekly reports, automate everything.

Conditional formatting: make bad data visible

Conditional formatting highlights cells based on rules. It turns your spreadsheet into a visual dashboard:

  • Highlight duplicates — instantly see repeated entries
  • Color scale on numbers — green for high, red for low
  • Flag blanks — bright yellow background on empty cells
  • Data bars — mini bar charts inside cells

This isn't decoration. It's error detection. When a cell that should contain a dollar amount turns red because it's text, you've found a problem in seconds instead of hours.

⚡

Clean This Dataset

25 XP
You receive a spreadsheet with 5,000 customer records. Before analysis, you spot these issues: - Column A (Name): mix of "john doe", "JOHN DOE", and "John Doe" - Column B (Email): 47 duplicate email addresses - Column C (Phone): some entries have dashes, some have dots, some have parentheses - Column D (Revenue): 12 cells show "$1,234" as text instead of a number - Column E (Date): mix of "MM/DD/YYYY" and "DD-Mon-YYYY" formats Write the formula or describe the process to fix each issue: 1. Standardize names: ___ 2. Find and handle duplicates: ___ 3. Standardize phone format: ___ 4. Convert text to numbers: ___ 5. Standardize dates: ___

Power user techniques

These are the moves that separate beginners from analysts who get things done fast.

Keyboard shortcuts that save hours

ActionWindowsMac
Select entire columnCtrl+SpaceCtrl+Space
Select entire rowShift+SpaceShift+Space
Fill downCtrl+DCmd+D
Go to last cell in rangeCtrl+EndCmd+End
Insert new rowCtrl+Shift+=Cmd+Shift+=
Find and ReplaceCtrl+HCmd+H
Paste Values onlyCtrl+Shift+VCmd+Shift+V
Absolute reference ($)F4Cmd+T
⚠️The Paste Values trap
When you copy a cell with a formula and paste it, the formula adjusts its references. This is usually helpful — unless you're pasting into a different context. When you want just the result, not the formula, always Paste Values. This is the #1 source of spreadsheet errors in business.

Named ranges

Instead of =SUM(Sheet2!B2:B5000), create a named range called Revenue and write =SUM(Revenue). Your formulas become self-documenting and don't break when rows are added.

Data validation

Restrict what can be entered in a cell: dropdowns for categories, number ranges for amounts, date ranges for dates. This prevents dirty data at the source — far better than cleaning it after.

750M+people use Excel worldwide (Microsoft estimate)

80%of businesses rely on spreadsheets for planning (surveys vary; directional)

88%of spreadsheets contain errors (University of Hawaii study, Ray Panko)

That last stat should terrify you. Nearly 9 in 10 spreadsheets have errors. Data validation, named ranges, and clean structure are how you avoid being part of that statistic.

Key takeaways

  • 12 formulas cover 80% of analysis: SUM, AVERAGE, COUNT, IF, VLOOKUP, INDEX/MATCH, SUMIF, COUNTIF, CONCATENATE, TEXT, LEFT/RIGHT/MID, IFERROR
  • Pivot tables are the most powerful spreadsheet feature. They replace dozens of formulas with drag-and-drop summarization. Learn them before anything else.
  • Data cleaning is the job. TRIM, PROPER, Remove Duplicates, and conditional formatting are your first line of defense against bad data.
  • VLOOKUP finds values across tables. Learn it, then graduate to INDEX/MATCH for flexibility. Always use FALSE for exact match.
  • Power user habits compound: keyboard shortcuts, named ranges, data validation, and Paste Values save minutes per day that add up to weeks per year.

?

Knowledge Check

1.You use =VLOOKUP(A2, Sheet2!A:D, 3, TRUE) and get an incorrect result. What is the most likely cause?

2.What is the primary advantage of a pivot table over writing SUMIF formulas for every combination of categories?

3.A column contains dollar values like '$1,234.56' that won't sum because they are stored as text. What is the most reliable way to fix this?

4.Why is Paste Values (instead of regular Paste) critical when copying formula results to another location?

Previous

Data Visualization

Next

Data Cleaning & Preparation