Articles

  • Aug 22, 2024 | medium.com | Siqi Li

    The AWS Cloud Development Kit (CDK) is a powerful tool for defining cloud infrastructure using familiar programming languages (Python in my case). However, like any tool, there are nuances, best practices, and potential pitfalls that become evident only through hands-on experience. In this article, I’ll share some practical notes and lessons learned from using AWS CDK in various real-world machine-learning projects.

  • Aug 8, 2024 | medium.com | Siqi Li

    For handling categorical features, if you are only familiar with one-hot encoding, now it is time to enrich your daily data scientist toolbox with another powerful encoding method: Frequency Encoding. It is especially useful when you are using tree-based models like Random Forest or Gradient Boosting, and dealing with high cardinality features. When to Use Frequency Encoding1. High Cardinality Features:Definition: When a categorical feature has a large number of unique values.

  • May 21, 2024 | mdpi.com | Xian Pan |Wang Dai |Zhenzhen Wang |Siqi Li

    All articles published by MDPI are made immediately available worldwide under an open access license. No specialpermission is required to reuse all or part of the article published by MDPI, including figures and tables. Forarticles published under an open access Creative Common CC BY license, any part of the article may be reused withoutpermission provided that the original article is clearly cited. For more information, please refer tohttps://www.mdpi.com/openaccess.

Contact details

Socials & Sites

Try JournoFinder For Free

Search and contact over 1M+ journalist profiles, browse 100M+ articles, and unlock powerful PR tools.

Start Your 7-Day Free Trial →