Prepared By : Prof. Uday Shah (HOD-IT)
Working with Data Sources in Tableau
1. Tableau – Connect to a Data Source
-
Tableau allows you to connect to multiple data sources like Excel, CSV, SQL Server, Google Sheets, etc.
-
From the Start Page, click on Connect in the left sidebar.
-
Select the type of data you want to connect (e.g., Excel, Text, Server).
-
Once selected, browse your file location or enter server credentials.
-
Tableau loads the data into the Data Source Tab.
-
You can preview data in a grid-like format before importing it.
-
You can rename fields directly here for clarity.
-
Multiple tables from the same file can be loaded.
-
Drag tables into the canvas to define relationships.
-
Data can be Live Connected or Extracted for performance improvement.
-
Live Connection keeps data updated from source.
-
Extract stores a snapshot for faster processing.
2. Connect with Text File
-
Used to connect .txt or .csv files.
-
In Tableau, click Connect → Text File.
-
Select your CSV or TXT file.
-
Tableau automatically detects delimiters (comma, tab, pipe).
-
You can change delimiter type in connection settings.
-
Supports UTF-8 encoding.
-
Header rows can be detected automatically.
-
Data preview is shown in the Data Source window.
-
Column names and data types are inferred but editable.
-
Very useful for importing large exported reports.
-
Can be combined with joins if other sources are connected.
-
Data Extracts can speed up processing for large text files.
3. Connect with Excel
-
Most common data connection type in Tableau.
-
Supports
.xls
and.xlsx
formats. -
Go to Connect → Microsoft Excel.
-
Select the file from your computer.
-
All sheets are displayed on the left.
-
Drag sheets into the canvas to load them.
-
Multiple sheets can be joined/related.
-
Data can be pivoted within Tableau.
-
Field names and data types are inferred automatically.
-
Can connect with named ranges in Excel.
-
Large Excel files might need extracts for speed.
-
You can refresh Excel data without re-importing.
4. Tableau Extracting Data
-
Extracting creates a static snapshot of your data.
-
Improves performance with large datasets.
-
File is stored as
.hyper
format in Tableau. -
Supports filters to limit extracted data.
-
Extracts can be refreshed manually or scheduled (Tableau Server).
-
Allows offline analysis without original data connection.
-
Reduces load time for dashboards.
-
Enables better aggregation performance.
-
Extracted data can still be filtered in Tableau.
-
Can be optimized for faster query execution.
-
Extracts can be published and shared.
5. Data Connecting in Tableau
-
Tableau supports Live and Extract connections.
-
Live connection always fetches the latest data from the source.
-
Extract connection stores a cached version.
-
You can connect to multiple data sources simultaneously.
-
Tableau allows blending/joining between different sources.
-
Connection type affects performance.
-
You can switch between Live and Extract anytime.
-
Live is better for real-time dashboards.
-
Extract is better for speed and offline use.
-
Secure connections can be set up for databases.
-
Data refresh schedules are available in Tableau Server/Online.
6. Tableau Editing Metadata
-
Metadata includes field names, data types, and descriptions.
-
You can rename fields for better readability.
-
Change data types (String, Date, Number).
-
Hide unnecessary fields.
-
Create calculated fields.
-
Change default aggregation (SUM, AVG, COUNT).
-
Set geographic roles for location-based fields.
-
Add comments for future reference.
-
Group fields into folders.
-
Metadata changes don’t affect the original source.
-
Can be reset to original source state.
7. Tableau Data Joining
-
Joins combine tables within the same data source.
-
Supports Inner, Left, Right, Full Outer joins.
-
Join keys define how tables match.
-
Can join multiple tables together.
-
Joins happen at row level.
-
Useful when tables share common fields.
-
Join performance depends on data size.
-
Data preview shows join results instantly.
-
Multiple join clauses are possible.
-
Avoid unnecessary joins to improve speed.
8. Tableau Data Blending
-
Combines data from different data sources.
-
Requires a primary and secondary data source.
-
Linking is done on common fields.
-
Primary source fields have blue check marks.
-
Secondary source fields have orange check marks.
-
Useful for combining different database types.
-
Works at aggregated level (not row level like joins).
-
Blending is slower than joining in same source.
-
Blended fields can be used in same visualization.
-
Limited by aggregation differences between sources.
9. Tableau Replacing Data Source
-
Allows switching one data source with another.
-
Used when data source location changes.
-
Keeps existing visualizations intact.
-
Both data sources must have matching field names/types.
-
Avoids rebuilding dashboards from scratch.
-
Useful for moving from development to production.
-
Can replace Excel with SQL for performance.
-
Tableau automatically maps matching fields.
-
Non-matching fields need manual remapping.
-
Old data source is removed after replacement.
:: Best of Luck ::