We were lucky to have Drew Conway, Head of Data Science at Two Sigma Real Estate’s private investment platform, as our keynote speaker for VTS Accelerate 2024.
Conway shared his insights into the integration of data science and machine learning within the real estate sector. His talk focused on Two Sigma’s approach to leveraging technology for data-supported investment decisions. Check out our recap below to learn his main points.
Integrating Data Science with Real Estate Expertise
The core of Conway's presentation revolved around Two Sigma Real Estate's approach to integrating data science and machine learning with traditional commercial real estate expertise. The goal is not just to collect data but to develop proprietary metrics and models that provide accurate, actionable insights.
Two Sigma Real Estate uses a variety of data sources, including:
- Market trends and economic indicators.
- Property-specific information.
- Aggregated data to reveal patterns invisible to traditional analysis.
The data science team works closely with investment professionals to ensure these models are practical and useful during various stages of the investment process.
Ensuring the Integrity and Quality of Data
Data integrity is critical, especially in real estate’s notoriously "messy" data environment. Conway shared an experiment comparing reported rent growth across multiple third-party vendors, revealing significant variances.
Transparency and Backtesting
To solve this, Two Sigma developed property-level statistics to pinpoint the strengths and weaknesses of different data sets.
- Transparency: "If your investment team doesn’t have a deep understanding of not only the data that went into a model but how that model was fit, they're not going to trust it," Conway explained.
- Backtesting: By comparing past predictions with actual results, the team can either stand by their data or adjust their models for higher accuracy.
Applying Models to Real-World Investments
With data science-supported insights, investment teams can focus on high-potential opportunities. Conway highlighted the link between consumer behavior and rent growth:
"[What] was interesting was the amount of rent growth that we could measure directly in markets that were exposed to high degrees of durable consumer goods spending.”
This allows Two Sigma Real Estate to target investments in areas with strong consumer spending trends with much higher precision.
Balancing Accuracy and Stability in Predictions
A major challenge in predictive modeling is preventing "model drift" from confusing investment teams during a transaction.
- Ensemble Modeling: Two Sigma addresses this by combining multiple models to improve stability.
- Building Trust: This methodology ensures that predictions remain reliable and consistent, helping investment professionals rely on the data throughout the life of a deal.
Putting the Tool into Practice
One of Two Sigma’s internal tools offers real-time analytics and neighborhood-level visualizations. This platform allows data scientists and real estate professionals to collaborate on due diligence, spot opportunities, and generate customized reports in an intuitive interface.
The Impact of Data Science in Real Estate Investing
Conway concluded by outlining four key benefits of a data science-driven approach:
- Enhanced Tactical and Strategic Decisions: Combining top-down macro knowledge with bottom-up data.
- Conviction to Act with Precision: Increased confidence in informed decision-making.
- Native Scale for Opportunity Identification: Engineered systems that identify deals at a rapid pace.
- Risk-Adjusted Enhanced Returns: Achieving better outcomes through machine learning.
Two Sigma Sets a New Benchmark
Two Sigma Real Estate’s approach provides a roadmap for the future of the industry. Investors who embrace data science and machine learning in real estate will be better positioned to navigate the market and achieve superior portfolio outcomes.
FAQs
1. Why is data integrity a challenge in real estate data science?
Real estate data is often fragmented across multiple third-party vendors. As Drew Conway demonstrated, rent growth data can vary significantly between sources, making it difficult to gain conviction without proprietary filtering and backtesting to verify accuracy.
2. What is "Ensemble Modeling," and why does it matter?
Ensemble modeling combines multiple predictive models into one to improve stability. This prevents the "confusing" fluctuations that can occur when using a single model, ensuring that investment teams have consistent, reliable data during a transaction.
3. How does consumer spending data impact office or residential investments?
Two Sigma found a direct correlation between durable consumer goods spending and rent growth in specific markets. By analyzing these "bottom-up" consumer patterns, investors can predict which neighborhoods or regions are likely to outperform before traditional market reports catch up.potential for data science in real estate investment is limitless.


