Sampled 150 users over 30 days longitudinally.
High-level snapshot (46 users)
Metric | Value | What it tells us |
---|---|---|
Total events | 2 670 | 30-day sample size. |
Median events / user | 16 | Half the users log ≤ ~1 scan every other day. |
Median duplication ratio | 0.50 | One-in-two scans is an exact repeat of something they already logged. |
Users with dup-ratio > 0.60 | 4 (9 %) | Extreme repeaters. |
Users with dup-ratio < 0.20 | 4 (9 %) | High-variety “explorers”. |
Median semantic similarity | ≈ 0.34 | Food names per user are moderately alike (lots of variants on the same staples). |
Notable segments
Segment | Example ID | Events | Dup-ratio | Semantic sim | Staples |
---|---|---|---|---|---|
Ultra-repeat power logger | vJd5w8jr… | 310 | 0.65 | 0.35 | 27× “Waffle”, 24× “Spaghetti + Beef”, plus pizza & nuggets. Breakfast/dinner on auto-pilot. |
Routine breakfasters | Mlmd…4j2 (from earlier run) | 55 | 0.62 | 0.47 | Scrambled eggs, bacon, avocado variants. High semantic overlap → same meal, tiny tweaks. |
Protein-and-fruit snackers | Bv8LBjMj… | 90 | 0.50 | 0.34 | Blueberries, protein yogurt/whey, salads—healthy repetitive snacking. |
Low-variety low-volume | iarW…TK2 | 2 | 0.50 | 0.00 | Logs one frozen meal twice—likely feature test or diet plan. |
Explorers / foodies | TA4nY7Vb… | 193 | 0.31 | 0.37 | 133 unique foods; repeats mainly “Breakfast Plate” & bananas—otherwise very diverse. |
Synonym duplicates (very high semantic sim) | RzbkvID… | 4 | 0.50 | 0.83 | “Steak” vs “Beef Steak” → data normalisation opportunity. |
Single-food binge | TVaxCTpn… | 4 | 0.75 | 0.00 | 4 scans of “Goldfish Crackers” in a row—temporary craving or barcode test. |
Key takeaways
-
50 % literal repetition is the norm. Predictive fill-in or “log again?” prompts should boost UX for the majority.
-
High semantic similarity but different names (e.g., “Steak” vs “Beef Steak”, “Blueberry” vs “Blueberries”).
-
Extreme repeaters (> 0.6 dup-ratio) make up ~10 %.
-
Explorers (< 0.2 dup-ratio) are equally common.
-
Data quality flags
- Blank
food_name
entries (appearing twice) - Generic labels like “Food Plate”, “Breakfast Plate” still frequent.
- Blank
-
Most-logged staples across users
- Eggs / breakfast plates
- Bananas & other single fruits
- Chicken (sandwiches, breasts, bowls)
- Protein shakes / bars
- Simple carbs (waffles, pancakes, pasta, rice)