What skills will be developed in the course?

The Complex Data Mining Extension Course (MDC) aims to train professionals for the current job market, with an emphasis on: (1) improving data management with speed, capacity and scalability in mind; (2) develop techniques for visualizing these data; (3) finding new business opportunities; (4) improving the data analysis capacity; and (5) create predictive models using the most modern machine learning methods.


Extension course

Email: mdc@ic.unicamp.br

Phone Number: (19) 3521-5883

Presented by:



The Complex Data Mining Extension Course (MDC) is made up of 9 subjects that teach the main concepts required by the job market, totaling 180 hours, with 144 hours of classes and 36 hours of supervised activities, fully online (via Zoom). Classes will be recorded and the videos will be available to students for up to 15 days after each class.

Digital Certificate

Students who pass the 9 subjects will be entitled to the certificate of the Extension Course in Complex Data Mining (MDC), issued by the Unicamp Extension School (see the Certificate Template).


The teaching staff of the Extension Course in Complex Data Mining (MDC) is made up of professors and researchers, all with doctorates, with extensive experience in the area.

  • Disciplines

  • DATA ANALYSIS (INF-0612 - Messing with Data)
    Teacher: Zanoni Dias
    Introduction to Data Analysis using the R Language. Data types (vectors, lists, matrices, data frames, etc.). Predefined functions. Implementation of functions in R. Treatment, analysis and visualization of data.
    Classes: 20/07/2024, 27/07/2024, 03/08/2024 and 10/08/2024, from 08:30 am to 12:30 pm.

    INFORMATION RECOVERY (INF-0611 - Gathering Data)
    Teacher: Lin Tzy Li
    Introduction to information retrieval. Ranking evaluation techniques. Unstructured data recovery concepts. Text recovery. Image recovery by content. Video recovery. Techniques for improving ranking quality.
    Classes: 20/07/2024, 27/07/2024, 03/08/2024 and 10/08/2024, from 13:30 pm to 17:30 pm.

    Teacher: Hélio Pedrini
    Discovery of knowledge. Understanding and prospecting for information. Exploratory data analysis. Anomaly detection. Association rules. Dimensionality reduction. Attribute selection. Grouping techniques.
    Classes: 17/08/2024, 24/08/2024, 31/08/2024 and 07/09/2024, from 13:30 pm to 17:30 pm.

    SUPERVISED MACHINE LEARNING I (INF-0615 - Learning from Data)
    Teacher: Anderson de Rezende Rocha
    Classification problems. Decision boundaries. Linear and non-linear classifiers, logistic regression, decision trees and random forests. Overfitting and validation. Ensemble methods: bagging, boosting and stacking. Cross validation. Imbalance, diagnosis of bias and variance. Evaluation measures. Interpretation of models (X-AI) and classification in open scenario (open-set).
    Classes: 17/08/2024, 24/08/2024, 31/08/2024 and 07/09/2024, from 08:30 pm to 12:30 pm.

    VIEWING INFORMATION (INF-0614 - Viewing data)
    Teacher: Celmar Guimarães da Silva
    Theoretical and practical aspects of Information Visualization (InfoVis). Representation of data in a graphic and interactive way. InfoVis reference model. Characterization of data. Recommendations for visual mapping. Visualization of multidimensional data. Visualization of texts.
    Classes: 14/09/2024, 21/09/2024, 28/09/2024 and 05/10/2024, from 08:30 am to 12:30 pm.

    SUPERVISED MACHINE LEARNING II (INF-0616 - Thinking with Data I)
    Teacher: Esther Luna Colombini
    Introduction to the Python language. Support Vector Machines (SVMs): kernels (linear and non-linear), SVRs and one-class SVM. Regularization techniques. Grid-search and random-search. Neural networks: types of networks, forward and backward propagation, and activation functions. Statistical tests.
    Classes: 14/09/2024, 21/09/2024, 28/09/2024 and 05/10/2024, from 13:30 am to 17:30 pm.

    BIG DATA (INF-0617 - Big Data)
    Teacher: Lucas Francisco Wanner
    Introduction to parallel and distributed computing. Parallel data processing in Python. Distributed data processing with Map-Reduce and Hadoop Streaming. Introduction to tools for analyzing and processing data with Hadoop and Spark.
    Classes: 12/10/2024, 19/10/2024, 26/10/2024 and 02/11/2024, from 08:30 am to 12:30 pm.

    DEEP LEARNING (INF-0618 - Thinking with Data II)
    Teacher: Marcelo da Silva Reis
    Deep learning and convolutional neural networks (CNN). Convolution: padding and stride. Loss functions. Training: activation, pre-processing, data augmentation, weight initialization and parameter optimization functions. Regularization. Learning transfer. Recurrent Neural Networks (RNN). Transformers. Detection and Segmentation. Generative Adversarial Networks (GAN). Interpretability (X-AI). Tools: TensorFlow and Keras.
    Classes: 12/10/2024, 19/10/2024, 26/10/2024 and 02/11/2024, from 13:30 am to 17:30 pm.

    FINAL PROJECT (INF-0619 - Data @ Work)
    Teacher: Zanoni Dias
    Definition of target problem. Data identification and collection. Analysis of the techniques to be employed. Comparative study. Analysis, visualization and presentation of results.
    Classes: 23/11/2024, 30/11/2024, 07/12/2024 and 14/12/2024, from 08:30 am to 12:30 pm.

  • 100% Online Course

  • All classes will be held and broadcast live (via Zoom), with the participation of students in real time, on the days and times indicated above. Classes will be recorded and the videos will be available to students for up to 15 days after each class. Course material (slides, tutorials, codes, etc.) will be made available to students (via Moodle). Questions will be answered from Monday to Friday, with teachers and monitors, synchronously (via Zoom) and asynchronous (via Slack). Assessments will be carried out through practical work.

  • Registration

  • The following documents are required for registration:

    Registration Form and Term of Commitment signed digitally (documents generated by Online Pre-Registration)
    Diploma or Certificate of Completion of Undergraduate Course
    RG and CPF
    Cover letter (optional, free format, one page, attach to CV, to be sent through the system)

    The documents listed above must be presented on both sides, whenever there is any information recorded on the back of the document.


    Documents must be received through the Extecamp system by 30/06/2024 (Sunday).
    In case of doubts about the registration documentation, consult the Extension Secretariat (itext@unicamp.br).
    Late registration will not be accepted.

  • Investment

  • The total cost of the course (R$8.999,95) can be paid in 5 interest-free installments.

    Special discounts (cumulative):

    R$2.000,00 discount for cash payment.
    R$1.000,00 discount for payment in 3 installments without interest.
    R$1.000,00 discount for former Unicamp students.
    R$1.000,00 discount for registrations made between 01/06/2024 and 16/06/2024. [promotion closed]
    R$2.000,00 discount for registrations made until 31/05/2024. [promotion ended]


    The discounts mentioned above will be applied manually when issuing bank slips, after the selection process (the system may display amounts without discounts at the time of registration).
    The payment of the first monthly installment or the single installment, depending on the payment method chosen, must be made by 10/07/2024.
    To qualify for the early registration discount, all documents must be delivered by the dates indicated.
    To qualify for the discount for Unicamp alumni, the candidate must present, at the time of registration, a diploma or a certificate of completion of an undergraduate or graduate course (master's or doctorate) issued by Unicamp.
    As the discounts are cumulative, it is possible to obtain up to R$5.000,00 of discount (considering the discounts listed above, applying the respective conditions).

  • Details

  • Prerequisite: Full upper level. Basic programming knowledge.
    Target Audience: Computer professionals, trained in Computing or related areas (Engineering or Exact).
    Selection criteria: Analysis of Curriculum and Cover Letter (optional).
    Course type: Extension course.
    Class schedules: Saturdays, from 8:30 am to 12:30 pm and from 13:30 pm to 17:30 pm.
    Required Material: As it is a course with a practical focus, all students must have a computer / notebook with internet access to follow the classes and proposed practical activities.
    Class size: A minimum of 20 and a maximum of 90 students.
    Course coordinator: Zanoni Days.

  • Calendar

  • Data Event
    02/05/2024 até 30/06/2024 Registration period
    30/06/2024 Deadline for submission of registration documents
    05/07/2024 Disclosure of candidates selected for registration
    10/07/2024 Maturity of the first or single installment
    20/07/2024 até 14/12/2024 Course offering period