Assignment
Assignment
Whisper
Sarahah
X (Formerly Twitter)
Accepted Answers: Whisper Sarahah
Which of the following statements is/are true about ephemeral content on social
networks ?
Velocity
Veracity
Volume
Variety
Accepted Answers: Veracity
True
False
Accepted Answers: False
____ helps users on social media to find posts related to a specific topic by clicking or
searching.
Mentions
Hashtags
Shares
Likes
Accepted Answers: Hashtags
Interconnected Networks
numpy
requests
math
PIP
Virtual Environment
Allows for collaboration between various people working on the same project.
GitHub
GitLab
Bitbucket
git status
git log
Accepted Answers: git add --all
Creating a new file in a git repository adds it to :
Tracked Files
Untracked Files
Remote Repository
Select the correct syntax for pushing an existing repository to a remote repository on
GitHub
Which of the following is the correct syntax for installing a package Ubuntu computer ?
In which of the following formats does an API usually return its response?
JSON
XML
TSV
JSON
SQL
XML
True
False
Accepted Answers: True
Given a MongoDB collection named "words" with the following document structure:
{
"_id":ObjectId("615243d88f0ae43f255874c1"),
"count": 300,
"department": "the"
}
Which MongoDB query would you use to retrieve all words with count in descending order
?
db.words.find().sort({ count: -1 })
db.words.find().sort({ count: 1 })
db.words.find().order_by("count", DESCENDING}
db.words.find().sort({ descending: 1 })
Accepted Answers: db.words.find().sort({ count: -1 })
Vertices
Edges
Unique Identifier/ID
Cannot be Determined
Accepted Answers: False information/rumors spreads faster than facts/true information
on social media
Select the correct sequence of steps taken towards evaluating social media data.
Data Collection/Filtering > Data Characterization > Data Classification > Data
Evaluation
Data Collection/Filtering > Data Characterization > Data Evaluation > Data
Classification
Data Characterization > Data Collection/Filtering > Data Evaluation > Data
Classification
Data Characterization > Data Collection/Filtering > Data Classification > Data
Evaluation
Accepted Answers: Data Collection/Filtering > Data Characterization > Data
Classification > Data Evaluation
Which of the following attributes can be inferred from data collected via X (formerly
Twitter) ?
Which of the following processes can be used to receive annotated posts while
determining their legitimacy ?
TweetCred is a chrome extension that ranks the credibility of each tweet on Twitter.
True
False
Accepted Answers: True
Which Python library would you most likely use to get the top 10 hot posts from the
subreddit "r/politics" using the Reddit API?
praw
requests
urllib
json
Accepted Answers: praw
Karma
Comment
Upvote
Accepted Answers: Flair
Subreddits
Flairs
Threads
A higher Follower-Friend ratio on Twitter can be associated with the popularity of the
user
True
False
Accepted Answers: True
Select the correct sequence of steps as used by Facebook Inspector (as Discussed in
Lecture)
Data Collection > Ground Truth Extraction > Feature Generation > Supervised
Learning > Result via RESTful API
Feature Generation > Data Collection > Supervised Learning > Ground Truth
Extraction > Result via RESTful API
Ground Truth Extraction > Data Collection > Supervised Learning > Feature
Generation > Result via RESTful API
Data Collection > Feature Generation > Ground Truth Extraction > Supervised
Learning > Result via RESTful API
Accepted Answers: Data Collection > Ground Truth Extraction > Feature Generation >
Supervised Learning > Result via RESTful API
Which of the following is/are correct statements in context of Web of Trust Score?
Fundamentalist
Pragmatist
Unconcerned
"Geeta browses all social media websites on her laptop casually, but always opens her
banking website in incognito for checking her balance” According to Westin’s 3
Categories, Geeta is a :
Fundamentalist
Pragmatist
Unconcerned
According to Westin’s 3 Categories, the lowest percentage of US citizens fall into the
category of (as discussed in the lecture):
Fundamentalist
Pragmatist
Unconcerned
Accepted Answers: Unconcerned
According to Westin’s 3 Categories, the highest percentage of US citizens fall into the
category of (as discussed in the lecture) :
Fundamentalist
Pragmatist
Unconcerned
Accepted Answers: Pragmatist
Which of the following can be used in authentication for the Twitter API ?
Bearer Token
Client ID
Client Secret
CREATE TABLE (
tweet_id VARCHAR(50) NOT NULL,
text VARCHAR(500) NOT NULL,
PRIMARY KEY (tweet_id),
TABLE NAME(tweets)
);
CREATE TABLE tweets (
tweet_id VARCHAR(50),
text VARCHAR(500),
PRIMARY KEY (tweet_id),
NOT NULL (text)
);
Read through the report “Privacy in India: Attitudes and Awareness V 2.0”
at https://cdn.iiit.ac.in/cdn/precog.iiit.ac.in/research/privacyindia/
PI_2012_Complete_Report.pdf And answer the following questions [11-12] :
Which of the following method(s) were used for gathering information across India in the
report above ?
Surveys
Interviews
Surveys
Interviews
Database
Index
Document
Collection
Accepted Answers: Collection
A row in SQL is called a ______ in MongoDB.
Collection
Field
Index
Document
Accepted Answers: Document
Which of the following can be used to create a new database called “project” and create
a collection named “users” inside it in MongoDB ?
1. use project;
2. db.users.insert({});
1. use project;
2. db.users.insert();
1. create_db project;
2. db.users.insert({});
1. create project;
2. db.users.insert();
Accepted Answers: 1. use project; 2. db.users.insert({});
Information
Communication
Territorial
Bodily
Accepted Answers: Information Communication Territorial Bodily
Re-identification
Race
Age
Aadhaar Number
College Student ID
Accepted Answers: Aadhaar Number College Student ID
NumPy arrays are faster and more compact than Python lists.
NumPy uses much less memory to store data and it provides a mechanism of
specifying the data types.
import numpy as np
arr = np.array([1,2,3,4])
import numpy as np
arr = numpy.array([1,2,3,4])
import numpy as np
arr = np.array(1,2,3,4)
import numpy as np
a = np.array([range(i,i+3) for i in [1,2,3]])
print(a)
[[1 2 3]
[2 3 4]
[3 4 5]]
[[1 2 3]
[1 2 3]
[1 2 3]]
[[1 2 3]
[4 5 6]
[7 8 9]]
Which of the following codes can be used to make a NumPy array of 10 random numbers?
import numpy as np
import random
l = []
for i in range(10):
l.append(random.random())
zeros_array = np.array(l)
print(zeros_array)
import numpy as np
random_array = np.random.rand(10)
import numpy as np
random_array = np.random.random(10)
l = []
for i in range(10):
l.append(random.random())
zeros_array = np.array(l)
print(zeros_array)
import numpy as np
random_array = np.random.rand(10)
import numpy as np
random_array = np.random.random(10)
Which of the following NumPy functions can be used to generate an NxN diagonal
matrix ?
np.eye (5)
np.diagonal(np.eye (5))
np.diagonal (5)
Which of the following Python code can be used to read a CSV file using Pandas
import pandas as pd
data = pd.read_excel('file.csv')
import pandas as pd
data = pandas.readcsv('file.csv')
import pandas as pd
data = pd.read_csv('file.csv')
df['amount'].plot.hist()
df[df.columns[4]].plot.hist()
1 A
2 B
3 C
A 1
B 2
C 3
A B
C 1
2 3
Select the correct option for plotting the below function using matplotlib
plt.plot(np.cos(x))
plt.show()
plt.scatter(x, np.sin(x))
plt.show()
plt.scatter(np.cos(x), x)
plt.show()
#myNYPD
#myLAPD
#myDelhiPolice
User Handle
Profile Picture
Lexical Analysis
The data used to plot the above visualization is called Temporal Data
Temporal data
Spatial data
Linguistic data
None of the above
Accepted Answers: Temporal data
Consider the following tweet : “There is a huge pothole on the main road,Kindly repair it
Address:- Opposite Kachieguda police station, Hyderabad”. This is an example of ____
Actionable Information
Unactionable Information
Accepted Answers: Actionable Information
As discussed in the lectures, communication of the police with the citizens on Social
Media is ______
Formal
Informal
Accepted Answers: Formal
What is meant by an n-gram ?
n-gram
k-means clustering
LIWC
Topic Characteristics
Engagement Measures
Emotional Measures
Positive Emotions
Negative Emotions
Anxiety
Sadness
Accepted Answers: Positive Emotions Negative Emotions Anxiety Sadness
Select the correct option(s) below while analyzing a post on social media:
You receive an email from “Facebook Technical Support” from an unverified email, not
related to Facebook.com, asking you to reset your password via a suspicious form. This
is called ______.
Phishing
Clickbaiting
Compromised Account
While browsing a social media website, you come across the post shown above.
According to the lecture, the post falls under :
Hashtag Hijacking
Compromised Account
Clickbaiting
Whaling
Account Compromise
Hashtag Hijacking
Compromised Account
Spamdexing
Spamexing
Spamming
Spamming
Reciprocity
Link farming
Consider, all of them tweeted about #BLRrains expressing their happiness experiencing
rain.
According to the lecture, consider the indegree of these accounts and select which of
the following accounts are likely to rank higher while searching for #BLRrains on Twitter.
Anushka
Ramesh
Himanshu
Accepted Answers: Ramesh
Which of the following is/are the most widely used formats to represent node-edge graphs
on computers ?
Adjacency Matrix
CSV
GraphML
Directed Graph
Undirected Graph
Accepted Answers: Directed Graph
Consider the above image (Figure 2), and select the correct adjacency matrix
representation of the above shown undirected graph.
D
Accepted Answers: D
An NxN adjacency matrix implies there are ______ nodes in the graph.
N/2
N-1
2N-1
4
Accepted Answers: 2
Consider each node to be an account on Instagram in the graph above (Figure 4). Which
of the above nodes is/are the most influential ?
Node 1
Node 2
Node 4
Node 5
Accepted Answers: Node 4
Which of the following is/are graph centrality measures?
Node degree
Betweenness centrality
Closeness centrality
According to the study discussed in the lecture, which of the following users were most
likely to reciprocate links from spammers ?
Accepted Answers: People talking about Business, Internet Marketing, Money etc.
People promoting their own business or content or trends in a domain.
The above image from Gmail is an example of the ideas proposed by Facemail from MIT.
True
False
Observe the image below and answer the following questions [4-5] :
Which of the following nudge has been shown in the picture above ?
Sentiment Nudge
Timer Nudge
Picture Nudge
Making sure that you are sharing content with the intended audience/people.
Giving you the opportunity to cancel your post during a certain timeframe.
Making sure your posts are not perceived as negative by other users.
Accepted Answers: Making sure that you are sharing content with the intended
audience/people.
As discussed in the lecture, which of the following is/are not a social media nudge?
Picture Nudge
Timer Nudge
Voice Nudge
Video Nudge
The primary aim of nudges is to reduce user generated content on social media.
Accepted Answers: Interventions help users make better decisions about posting on
social media.
Nudges help individuals avoid regrettable online disclosures.
Hemant wanted to steal data from a rival business by going to their office and
transferring the required files by connecting a USB Flash Drive to the target computer.
This type of attack is called ________.
Physical Attack
Semantic Attack
Syntactic Attack
Physical Attacks
Semantic Attacks
Syntactic Attacks
Syntactic
Semantic
Physical
Vishing
Smsishing
['W', 'e', 'l', 'c', 'o', 'm', 'e', ' ', 't', 'o', ' ', 'P', 'S', 'O', 'S', 'M', '.']
apple
orange
yourself
most
L = {}
for i in tokens:
if i not in L:
L[i] = 1
else:
L[i] += 1
print(l)
Counter(tokens).count()
Accepted Answers:
L = {}
for i in tokens:
if i not in L:
L[i] = 1
else:
L[i] += 1
print(l)
from collections import Counter
print(Counter(tokens))
Which of the following refers to profile linking on social media as discussed in the
lectures ?
Accepted Answers: The ability to connect profiles of a user on different social networks.
Which of the following OSNs contain quality and descriptive information for the purposes
of profile linking, as discussed in the lecture ?
4Chan
Whisper
LinkedIn
The above X(formerly Twitter) profile helps in profile linking through self-identification.
True
False
Profile Picture
Username
Whisper
Yik Yak
Blind
Replies
Likes
Tweets
“If a whisper does not get attention shortly after posting, it is unlikely to get attention
later.” This statement is :
True
False
Observe the table given below and answer the following questions [11-13]
For the same number of Nodes/Users on Whisper, Facebook and Twitter, users on
Whisper create more connections/edges than on Facebook/Twitter.
True
False
False
True
False
Average Degree
Clustering Coefficient
Graph Density
Consider the above Degree Distribution graph generated in Gephi. Which of the following
statements is/are correct about this ?
Foursquare
Ola
Yelp
Uber
Accepted Answers: The person who checks in most frequently at a location in a given
period
Mayorships
Tips
Dones
Checkins
True
False
There is a high monotone increasing relation between the number of mayorships and
the number of tips.
Accepted Answers: There is a high monotone increasing relation between the number of
mayorships and the number of tips.
Look at the figure (Figure 2) given below for the Cumulative Distribution of Displacements
Between Consecutive Tips/Dones Posted per User:
About _____% of the users have spatial differences between their tips/dones at more than
or equal to 6,000km on average.
10
20
30
40
Accepted Answers: 10
Look at the (Figure 3) for the Cumulative Distribution of Time Interval Between
Consecutive Tips/Dones Posted per User :
50% of the users post tips/dones in intervals of less than 500 hours on average. This
statement is :
True
False
Scatter Plot
Bar Chart
Bubble Plot
Histogram
Bar Chart
Line Chart
Scatter Plot
Bubble Chart
Accepted Answers: Bar Chart Line Chart Scatter Plot Bubble Chart
To import highcharts into Python version >=3.10 use the following code:
bar
column
horizontal_bar
"legend": True,
Line 11
Line 12
Line 22
Line 26
Accepted Answers:
chart.add_data_set(data1, 'scatter', name = 'label_name')
chart.add_data_set(data1, 'scatter', 'label_name')
Which of the following social networks was NOT mentioned in the paper as a source for
the dataset ?
Foursquare
Google Plus
“Majority of the users on each of the above mentioned platforms provide home location
information at the street level.” This statement is:
True
False
Accepted Answers: False
Refer to the figure below (Figure 2) and answer the following question :
The distance between inferred and declared user home cities on Twitter is less than 40
km for ____ of users.
20 %
40 %
80 %
None
Accepted Answers: 20 %
Please go through the following paper, “On the dynamics of username change behavior
on Twitter” https://cdn.iiit.ac.in/cdn/precog.iiit.ac.in/Publications_files/04-Jain.pdf and
answer the following questions from [Q6-Q10]
1 point
From the following figure (Figure 3), what is the correct inference?
80% of users rarely change usernames and 20% change user names frequently.
20% of users frequently change usernames and 80% change usernames rarely.
The percentage of users attributed to the change of usernames does not follow a
Pareto Distribution.
Accepted Answers:
80% of users rarely change usernames and 20% change user names frequently.
20% of users frequently change usernames and 80% change usernames rarely.
The percentage of users attributed to the change of usernames follows a Pareto
Distribution.
Which of the following method(s) were used in the paper for measuring the similarity
between two consecutive usernames ?
Cosine Similarity
Most users added or deleted characters either from the start, middle or end of their
username.
Accepted Answers:
Most users added or deleted characters either from the start, middle or end of their
username.
Most users deleted characters at the end of their usernames.
Which of the following statements is/are true ?
Inactive users preventing others from using that specific username is called
Username Squatting.
Accepted Answers:
Inactive users preventing others from using that specific username is called Username
Squatting.
Refer to the figure below (Figure 5) about a partner account promoting another in a tweet
and answer the following question:
Which of the following phenomena is shown in the above figure as discussed in the
paper ?
Username Squatting
Adjust to Events
Space Gain
Please go through the following paper, “Boston Marathon Analyzing Fake Content on
Twitter” https://cdn.iiit.ac.in/cdn/precog.iiit.ac.in/Publications_files/ecrs2013_ag_hl_pk.pdf
and answer the following questions from [Q11-Q15]
1 point
Which of the following statements is true about Boston blasts?
Twin blasts occurred during the Boston Marathon on April 15th, 2013 at 18:50 GMT
Twin blasts occurred during the Boston Marathon on April 15th, 2009 at 18:50 GMT
Four people were killed and 264 were injured in the incident
Accepted Answers: Twin blasts occurred during the Boston Marathon on April 15th, 2013
at 18:50 GMT
What does the following figure depict (Figure 6) ?
The figure shows the temporal distribution of tweets after the Boston blast
The figure shows the spatial distribution of tweets after the Boston blast
Accepted Answers: The figure shows the temporal distribution of tweets after the Boston
blast
Refer to the table below (Table 1) about some parameters of Aditya’s account and
answer the following questions [Q13-15]
Calculate the GlobalEngagement for Aditya.
263.15
10.96
1.096
0.1
0.05
10
True
False
vim
neovim
vi
pip
npm
apt get
Anaconda
Club Penguin
Habbo
Value
Velocity
Volume
Veracity
Clickbaiting
True
False
Accepted Answers: 6
Which of the following is the correct command for initializing a git repository in a newly
created empty folder ?
git initialize
git create
git initiate
git init
Accepted Answers: git init
Latanya Sweeney's research highlighted how two seemingly unrelated pieces of
information can be linked to reidentify an individual. Which term best describes this
concept?
Data Anonymization
Data Aggregation
Reidentification
X (formerly Twitter)
YouTube
stopwords = ["i", "me", "my", "myself", "we", "our", "ours", "ourselves", "you", "your",
"yours", "yourself", "yourselves", "he", "him", "his", "himself", "she", "her", "hers",
"herself", "it", "its", "had", "having", "do", "does", "did", "doing", “mango”, "a", "an", "the",
"and", "but", "if", "or", "because", "as", "until", "while", "of", "at", "by", "for", "with",
"about", "against", "between", "into",”now"]
mango, now
mango