Sample Database

MindsDB provides a read-only PostgreSQL database pre-loaded with various datasets to help you test and explore the capabilities of Minds. These datasets are curated to cover a wide range of scenarios and use cases, allowing you to experiment with different data types and analysis techniques.

Our publicly accessible PostgreSQL database is designed for testing and playground purposes. By using these datasets, you can quickly get started with Minds, understand how it works, and see how it can be applied to real-world problems.

Connection

To connect to our read-only PostgreSQL database and access the example datasets, use the following connection parameters:

demo_config = DatabaseConfig(
    description='House Sales Data',
    type='postgres',
    connection_args={
        'user': 'demo_user',
        'password': 'demo_password',
        'host': 'samples.mindsdb.com',
        'port': '5432',
        'database': 'demo',
        'schema': 'demo_data'
    },
    tables=['house_sales']
)

Below is the list of all avaiable datasets as tables.

Data Tables

  • Name
    fraud_detection
    Description

    This dataset contains data on mobile money transactions where each step represents an hour of simulation.

    steptypeamountnameOrigoldbalanceOrgnewbalanceOrignameDestoldbalanceDestnewbalanceDestisFraudisFlaggedFraud
    1PAYMENT9839.64C1231006815170136.0160296.36M19797871550.00.000
    1PAYMENT1864.28C166654429521249.019384.72M20442822250.00.000
    1TRANSFER181.0C1305486145181.00.0C5532640650.00.010

    To learn more about this dataset, please visit this link.

  • Name
    customer_churn
    Description

    This dataset contains data of a U.S. bank customer churn.

    RowNumberCustomerIdSurnameCreditScoreGeographyGenderAgeTenureBalanceNumOfProductsHasCrCardIsActiveMemberEstimatedSalaryExited
    115634602Hargrave619FranceFemale4220111101348.881
    215647311Hill608SpainFemale41183807.86101112542.580
    315619304Onio502FranceFemale428159660.8310113931.571

    To learn more about this dataset, please visit this link.

  • Name
    customer_support_chat
    Description

    This dataset contains data of chat with customer support, which can be used to train Large Language Models (LLMs) such as GPT, Llama2, and Falcon, both for Fine Tuning and Domain Adaptation.

    flagsinstructioncategoryintentresponse
    Bquestion about cancelling order Order NumberORDERcancel_orderI've understood you have a question regarding canceling order Order Number, and I'm here to provide you with the information you need. Please go ahead and ask your question, and I'll do my best to assist you.
    BQZi have a question about cancelling order Order NumberORDERcancel_orderI've been informed that you have a question about canceling order Order Number. I'm here to assist you! Please go ahead and let me know what specific question you have, and I'll provide you with all the information and guidance you need. Your satisfaction is my top priority.

    To learn more about this dataset, please visit this link.

  • Name
    bank_customer_transactions
    Description

    This dataset contains data of customer transactions with demographic and shopping behavior information.

    Customer IDNameSurnameGenderBirthdateTransaction AmountDateMerchant NameCategory
    752858SeanRodriguezF2002-10-2035.472023-04-03Smith-RussellCosmetic
    26381MichellePhelps1985-10-242552.722023-07-17Peck, Spence and YoungTravel
    305449JacobWilliamsM1981-10-25115.972023-09-20Steele IncClothing

    To learn more about this dataset, please visit this link.

  • Name
    telecom_customer_churn
    Description

    This dataset contains data on customer activities, preferences, and behaviors, which is useful for understanding customer engagement patterns, predicting churn, and analyzing customer feedback.

    agegendersecurity_noregion_categorymembership_categoryjoining_datejoined_through_referralreferral_idpreferred_offer_typesmedium_of_operationinternet_optionlast_visit_timedays_since_last_loginavg_time_spentavg_transaction_valueavg_frequency_login_dayspoints_in_walletused_special_discountoffer_application_preferencepast_complaintcomplaint_statusfeedbackchurn_risk_score
    18FXW0DQ7HVillagePlatinum Membership17-08-2017NoxxxxxxxxGift Vouchers/Coupons?Wi-Fi16:08:0217300.6353005.2517781.75YesYesNoNot ApplicableProducts always in Stock0
    32F5K0N3X1CityPremium Membership28-08-2017?CID21329Gift Vouchers/CouponsDesktopMobile_Data12:38:1316306.3412838.3810YesNoYesSolvedQuality Customer Care0
    44F1F2TCL3TownNo Membership11-11-2016YesCID12313Gift Vouchers/CouponsDesktopWi-Fi22:53:2114516.162102722500.69NoYesYesSolved in Follow-upPoor Website1

    To learn more about this dataset, please visit this link.

  • Name
    house_sales
    Description

    This dataset contains data on houses sold throughout years.

    saledatematypebedroomscreated_at
    2007-09-30441854house22007-02-02 15:41:51.922127
    2007-12-31441854house22007-02-23 22:36:08.540248
    2008-03-31441854house22007-02-25 19:23:52.585358

Was this page helpful?