This interface validates social media data for the FIFA Club World Cup 2025 Canary FPP study, analyzing Coca-Cola's digital marketing presence across Brazil, Mexico, and United States.
- Open your web browser
- Navigate to the provided URL (local, network, or public)
- Enter login credentials:
- Username:
fifa_research - Password:
CanaryFPP2025!
- Username:
- File Format: Excel (.xlsx or .xls)
- Sheet Name:
coded data(preferred) or use default first sheet - Data Structure: Must follow Canary FPP Data Template 2025 v1
Your Excel file should include these columns (snake_case format):
id, date, month, time, media_type, social_platform, url, profile_name,
username, fifa_content, caption_title, english_caption, sentiment,
country, language, likes, comments, shares, views, engagement,
marketing_tactic, message_framing, product_type, account_type,
company, product_brand
Countries:
- Brazil, Mexico, United States
Social Platforms:
- Facebook, Instagram, Twitter/X, YouTube, TikTok
Marketing Tactics (11 categories):
- Direct product advertising
- Price promotion
- Contest and competition
- Events, occasions, and sponsorships
- General PR
- Corporate social responsibility
- Brand extension
- Non-brand extended product
- Surrogate marketing
- Community-based marketing
- Cobranding
Message Framing (12 types):
- Community celebrations
- Entertainment/Fun
- Environment eco-awareness
- Glamorization
- Health claims
- Informational
- Personal care and wellness
- Product features
- Social welfare
- Passion and emotion
- Socialisation/community
- Athletic performance and physicality, aspiration
- Click "Select Excel File"
- Choose your data file from your computer
- File will be validated for format and structure
- Click "🚀 Start FIFA 2025 Validation"
- Watch the progress bar (typically 1-3 minutes)
- Processing includes:
- Column structure verification
- Data quality assessment
- Auto-correction application
- Validation against study protocol
Navigate through the result tabs:
📈 Executive Summary
- Before/after quality scores
- Total records processed
- Auto-corrections applied
- Overall processing status
🔍 Validation Summary
- Detailed compliance metrics
- Column-by-column analysis
- Missing/extra columns report
🛠️ Auto-Corrections
- Complete log of changes made
- Row-by-row correction details
- Fuzzy matching results
📊 Visual Analytics
- Interactive overview dashboard
- Timeline analysis by country
- Company & brand performance charts
- Click "📥 Download Comprehensive Report"
- Receive Excel file with multiple sheets:
processed_data- Your cleaned datasetoriginal_raw_data- Unmodified inputexecutive_summary- Quality metricschanges_made- Auto-correction log- Additional validation sheets
Fuzzy String Matching (96% threshold):
- Marketing tactics typos → Correct categories
- Message framing errors → Valid types
- Product type mistakes → Proper classifications
- Platform name variations → Standard names
Date Standardization:
- Various formats → YYYY-MM-DD standard
- Invalid dates → Flagged for review
Engagement Calculation:
- Recalculated as: likes + comments + shares + views
- Corrects mathematical errors automatically
Language Correction:
- Brazil → Portuguese/English only
- Mexico → Spanish/English only
- United States → English/Spanish only
Column Standardization:
- Mixed case → snake_case format
- Spaces/hyphens → underscores
- Per Data Template specification
- Before Score: Percentage of valid rows in original data
- After Score: Percentage of valid rows after corrections
- Improvement: Difference between before and after scores
- 85%+ Quality Score: Ready for analysis
- 70-84% Quality Score: Manual review recommended
- <70% Quality Score: Significant issues, contact research team
- Missing required fields
- Invalid categorical values
- Country-language mismatches
- Incorrect engagement calculations
- Invalid URLs or usernames
- Wrong date formats
- Check File Format: Ensure .xlsx or .xls format
- Verify Sheet Name: Use 'coded data' if possible
- Review Columns: Match expected column names
- Sample First: Test with small file (100-500 rows)
- Wait for Completion: Don't refresh browser during processing
- Monitor Progress: Watch progress bar and status messages
- Check for Errors: Review any error messages carefully
- Review All Tabs: Check executive summary, corrections, and charts
- Verify Changes: Review auto-corrections log
- Download Report: Save comprehensive Excel report
- Submit Clean Data: Use 'processed_data' sheet for analysis
"File format not supported"
- Solution: Ensure file is .xlsx or .xls format
- Convert CSV files to Excel before upload
"Sheet 'coded data' not found"
- Solution: Rename your data sheet to 'coded data' (lowercase)
- Or ensure data is in the first sheet
"Column structure mismatch"
- Solution: Review column names against expected list
- Use snake_case format (e.g., 'social_platform' not 'Social Platform')
"Processing failed"
- Check: File isn't corrupted or password-protected
- Verify: Data contains expected columns
- Ensure: File size is reasonable (<100MB)
"Low quality score"
- Review: Auto-corrections log for issues
- Check: Country-language compatibility
- Verify: Required fields aren't empty
"Can't reach interface"
- Check: Correct URL and port (default 7860)
- Verify: Network connection
- Try: Different browser or incognito mode
"Login failed"
- Username:
fifa_research(lowercase) - Password:
CanaryFPP2025!(exact case)
- Country Distribution: Pie chart of post distribution
- Social Platforms: Bar chart of platform usage
- Marketing Tactics: Distribution of promotional strategies
- Message Framing: Types of content messaging
- Product Types: Coca-Cola product categories
- Account Types: Social media account classifications
- Daily Posts: Posting patterns during FIFA Club World Cup
- Country Trends: Geographic posting behaviors
- Interactive: Hover for details, zoom to specific periods
- Company Distribution: Brand presence analysis
- Product Brands: Specific product marketing
- Geographic Breakdown: Brand performance by country
- Clean Column Names: Use standard naming before upload
- Check Required Fields: Ensure critical columns have data
- Validate Dates: Use consistent date format
- Review Categories: Match expected values where possible
- Start Small: Test with subset before processing full dataset
- Review Corrections: Understand what gets changed automatically
- Manual Review: Check high-impact corrections manually
- Iterative Process: Re-upload with fixes if needed
- Share URLs: Use network/public URL for team access
- Standard Process: Establish consistent workflow
- Documentation: Keep log of processing dates and versions
- Quality Control: Have multiple team members review results
- Interface issues, login problems, upload errors
- Contact: [Your technical contact]
- Category classifications, auto-corrections, quality scores
- Reference: Canary FPP Study Protocol documentation
- Contact: [Your research team contact]
- Methodology, validation standards, reporting requirements
- Reference: Data Template and Codebook 2025 v1
- Contact: [Principal investigator contact]
- Canary FPP Study Protocol: [Link to documentation]
- Data Template 2025 v1: [Link to template]
- GitHub Repository: [Link to code repository]
- Technical Documentation: [Link to detailed docs]
Remember: This tool is designed to improve data quality, not replace careful data collection. Always review auto-corrections and maintain awareness of your data's context and meaning.